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A  SINGLE  PERIOD  MODEL  FOR  A  MULTIPRODUCT  PERISHABLE 
INVENTORY  SYSTEM  WITH  ECONOMIC  SUBSTITUTION 


Bryan  L.  Deuermeyer 

Department  of  Industrial  Engineering 
Texas  A&M  University 
College  Station,  Texas  77843 

ABSTRACT 

This  paper  develops  a  single  period  model  for  a  specific  class  of  multipro¬ 
duct  perishable  inventory  systems  where  demands  are  interdependent.  This 
class  of  inventory  systems  has  the  property  that  there  is  economic  substitution 
between  products.  It  is  shown  that  the  optimal  policy  has  the  economic  substi¬ 
tution  property,  and  that  the  rate  of  substitution  is  age  dependent.  The  model 
serves  as  a  generalization  of  a  theorem  discovered  by  Ignall  and  Veinott. 


1.  INTRODUCTION 


This  paper  considers  the  problem  of  characterizing  the  properties  of  optimal  ordering  poli¬ 
cies  for  a  specific  class  of  single  period,  multiproduct,  perishable  inventory  systems  where  there 
may  be  dependencies  between  the  stochastic  product  demands.  It  is  assumed  that  there  are  n 
products  and  m  demand  classes  for  these  products  and  that  each  product  has  a  known  fixed 
(and  finite)  lifetime.  As  is  usual,  the  criterion  for  choosing  an  optimal  ordering  policy  is  the 
minimization  of  total  expected  costs.  Pertinent  costs  are:  linear  purchasing  (production)  and 
outdating  (disposal)  costs  and  convex  holding  and  shortage  costs. 

The  special  class  we  consider  is  determined  by  the  property  called  economic  substitution. 
By  this,  it  is  meant  that  the  order  quantity  of  product  /(/  =  1,2,.  .  .,n)  is  a  nonincreasing  func¬ 
tion  of  the  on-hand  inventories  of  the  other  products.  In  effect,  some  amount  of  product  i  can 
be  replaced  by  an  increase  in  the  amounts  of  other  products  kept  on  hand,  in  terms  of  the 
economic  benefits  realized  by  the  firm.  This  phenomenon  typically  occurs  because  space  and 
capital  allocated  to,  say,  product  i  must  be  adjusted  (downward)  when  increasing  the  inventory 
commitment  to  another  product.  It  is  important  to  emphasize  that  the  phrase  "economic  substi¬ 
tution"  as  it  is  used  here  does  not  imply  that  another  product,  say  j,  can  be  substituted  for  pro¬ 
duct  i  to  satisfy  a  demand  for  product  /.  (The  latter  would  correspond  to  the  classical  economic 
interpretation.) 

The  model  developed  here  would  find  applications  in  the  retail  food  industry,  photo¬ 
graphic  film  industry,  pharmaceutical  industry,  and  finally  in  hospital  and  regional  blood  banks. 

Our  focus  is  to  characterize  the  properties  of  the  optimal  policy  when  economic  substitu¬ 
tion  is  in  effect.  We  show  in  Section  3  that  the  optimal  starting  inventory  (on-hand  plus 
amount  ordered)  is  in  general  more  responsive  to  changes  in  newer  inventory  than  older 
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inventory.  This  phenomenon  carries  over  to  economic  substitution.  That  is,  the  rate  of  substi¬ 
tution  is  age  dependent. 

The  study  of  perishable  inventory  theory  has  been  an  active  research  area  in  recent  years. 
Van  Zyl  [11]  provided  the  first  major  study  on  single-product  perishable  inventories,  but  the 
field  only  began  to  develop  after  the  work  of  Fries  [51  and  Nahmias  [7],  These  three  papers  all 
dealt  with  the  determination  of  optimal  policies.  These  policies  were  very  complicated  to 
implement,  particularly  from  a  computational  standpoint.  This  motivated  Cohen  [2]  to  develop 
an  approach  for  determining  optimal  single  critical  number  policies  that  were  easy  to  use.  Cha- 
zan  and  Gal  [11  proved  a  conjecture  made  in  [2]  that  the  expected  number  of  outdates  is  a  con¬ 
vex  function.  More  recently,  Nahmias  [9]  developed  an  approach  that  allows  relatively  easy 
computation  of  myopic  approximate  policies. 

Very  little  work  has  been  done  in  the  area  of  multiproduct  perishable  inventory  theory. 
Nahmias  and  Pierskalla  [10]  considered  a  two-product  perishable/nonperishable  model  which  is 
to  some  extent  applicable  to  whole  blood/frozen  blood  inventories,  and  to  foodstuffs  such  as 
milk  and  dry  milk.  Deuermeyer  [31  considered  a  two-product  perishable  inventory-production 
model  where  different  production  processes  must  be  coordinated  to  make  the  inventory  items. 
The  objective  of  this  paper  is  to  characterize  the  age-dependent  relationships  between  products. 

Several  papers  have  been  written  in  the  area  of  multiproduct  (nonperishable)  inventory 
theory.  A  summary  of  early  research  can  be  found  in  the  excellent  survey  by  Veinott  [12]. 
Deuermeyer  and  Pierskalla  [4]  considered  a  two-product  inventory-production  system  where  a 
by-product  process  and  a  single-item  production  process  must  be  coordinated  to  manufacture 
the  two  products.  Ignail  and  Veinott  [6]  originated  the  concept  of  economic  substitution  that 
we  use  here. 

The  article  is  organized  as  follows:  Section  2  provides  the  notation  and  assumptions  upon 
which  the  model  is  based;  Section  3  provides  a  detailed  discussion  of  the  optimal  policy  for  the 
general  one-period  model,  and  the  special  case  for  n  —  2  is  presented  as  an  example. 

2.  NOTATION  AND  ASSUMPTIONS 

Let  the  number  of  products  be  n  and  the  number  of  demand  classes  be  m.  Then  the  vec¬ 
tor  random  variable  D  =  iDx,Di,.  .  .,Dm)  is  the  joint  demand  during  the  period.  We  assume 
that  the  range  of  D  is_a  Borel  set  D.  From  D  we  form  the  product  demand  vector 
D  -  (5,,.  . ., D„ )  where  D,  is  the  total  demand  for  product  i  during  the  period.  Let  /,  be  the 
fixed  lifetime  for  product  i,  i  €  /  —  [1,2,.  . Let  /  =  l\  +  l2  +  ...  +  /„-  n.  I  is  the 
dimension  of  the  statespace  of  the  problem. 

The  following  assumptions  and  additional  notation  are  required: 

1.  I,  is  integral  valued  and  1  </,<«»,/€/. 

2.  All  stock  is  issued  to  meet  demand  according  to  a  FIFO  policy  (that  is,  oldest  first). 

3.  D  has  a  continuous  density  of  /  and  distribution  /such  that  /(r)  is  zero  except  on 
the  positive  orthant  of  E„  (Euclidean  ff-space). 

4.  All  units  are  fresh  when  they  enter  stock  and  there  is  no  delivery  lead  time. 

5.  The  vector  of  total  starting  (on-hand  plus  amount  ordered  and  before  demand) 
inventories  z  must  lie  in  a  topologically  closed  set  Z. 

6.  Let  Liz),  z  6  Z  be  the  expected  inventory  holding  and  shortage  cost  function. 
Specific  assumptions  concerning  L(  )  will  be  made  subsequently. 
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7.  Let  c,  be  the  proportional  cost  per  unit  of  product  /  purchased,  /'  €  /. 

8.  Let  9,  be  the  proportional  cost  charged  against  each  unit  of  product  /  that  outdates, 
/  6  /.  The  exact  method  for  charging  these  costs  will  be  clarified  shortly. 

Let  Xj,  be  the  initial  inventory  (already  on  hand  at  the  beginning  of  the  period)  of  age 

j,  j  €  [/]  “  {1,2,.  -  1)  of  product  /,  /'  €  /.  Further,  let  x,  -  (xn . be  the  vector 

of  initial  inventories  of  products  /  by  age,  /  6  /.  The  notation  x  will  be  reserved  for  the  vector 
x  =  (x1(x2,.  •  -,x„)  of  inventories  by  age.  In  addition,  we  define 

*,  -  £  xu>  •  €  ’• 

/  €  (f  I 

which  is  the  total  amount  of  product  /  on  hand  prior  to  ordering.  Finally  let  x  -  (*[ . x„). 

Once  again  x  refers  to  inventory  by  age  while  x  simply  refers  to  cumulative  stock. 

Let  v,  >  0  be  the  amount  of  product  i  ordered,  i  6  /,  and  y  —  0'i>-  ■  ■•yj  be  the  vector  of 

order  quantities.  Then,  the  vector  of  starting  inventories  (after  ordering  but  before  demand)  is 
given  by  z  =  x  +  y.  We  require  that  z  €  Z 

Let  a  and  b  be  vectors  in  E„.  a  <  b  means  that  a,  <  b,,  i  6  /,  and  a  <  b  means  that 
a  <  b  but  a,  <  b,  for  some  i.  Similarly,  a  <  b  means  that  a,  <  bh  i  6  I.  Let  [ a,b ]  “ 

(x  €  a,  <  x,  <  b,,  i  €  /).  [a,b\  is  called  a  closed  rectangle.  Finally,  let  E‘  be  the  extended 

Euclidean  space.  If  H  is  a  matrix,  then  \H\  is  the  determinant  of  H.  Let  H  be  a  square 
matrix.  Hik  is  the  matrix  formed  from  H  by  deleting  row  i  and  column  k  and  H‘k  is  the  matrix 
formed  from  H  by  interchanging  columns  /  and  k. 

Let  <f>  be  a  function  defined  on  £„  that  is  twice  continuously  differentiable  over  £„.  Then 
we  write 

D,  <b  (x, . x„)  -=  - —  <b  (x, . x„),  /  *=  1,  2,  . . .  ,  n, 

ax, 

and 

Du  <t>  (x,,.  .  .,x„)  =  —  D,  <t>  (x, . x„),  I,  j  -  1,  2,  ....  n. 

OXj 

We  let  V2  <t>  (x)  be  the  Hessian  matrix  of  4>(  )  evaluated  at  x.  Also,  let  t  be  fixed  and  let 
y  -  (vj,.  ...y,)  and  x  =  (y,+|1.  ..,>„).  Then,  V2d»  0>,x)  is  the  Hessian  of  <t>  restricted  to  the 
first  /  variables. 

We  follow  the  method  of  Nahmias  [7]  for  assigning  the  outdate  cost,  that  is,  the  outdate 
cost  0,  is  charged  against  the  expected  amount  of  the  order  y,  that  outdates  /,  periods  into  the 
future,  given  x,  on  hand.  Then  the  appropriate  cost  function  is  given  by 

V,  Cv,,x,)  =  9,  fQ  G,  ( u,x,)du , 

where  G,  (-,x,)  is  computed  recursively  using  the  marginal  distribution  £,(•)  of  the  total 
demand  for  product  /.  For  the  complete  derivation  of  the  "  V"  functions  and  their  properties, 
see  [7],  and  for  computational  results  when  demands  are  Erlang,  see  [8]. 

The  motivation  for  this  approach  is  that  the  full  impact  of  outdating  is  taken  into  account 
at  the  time  the  order  is  placed.  This  approach  is  particularly  attractive  when  studying  single- 
period  inventory  models  as  we  do  in  Section  3,  since  the  one-period  optimal  solution  would 
otherwise  ignore  outdating  entirely. 
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An  important  concept  used  extensively  throughout  this  paper  is  that  of  a  substitute 
matrix,  first  defined  by  Ignall  and  Veinott  [6].  Let  H(nxrt)  be  a  nonnegative  symmetric  posi¬ 
tive  definite  matrix.  Let  H'J  be  the  matrix  formed  from  H  by  interchanging  columns  i  and  j, 
where  i  <  j.  Then  we  say  that  H  is  a  substitute  matrix  if  every  principal  minor  of  //"  that  con¬ 
tains  elements  from  only  one  of  the  columns  /  or  j  is  nonnegative.  We  let  S„  be  the  class  of  all 
substitute  matrices  of  order  «.  When  n  —  2,  S„  consists  of  all  positive  definite  symmetric 
matrices  with  nonnegative  off-diagonal  elements.  Unfortunately,  S„  is  closed  under  addition 
only  when  n  -  2  (see  16]  for  examples) . 

3.  ANALYSIS  OF  OPTIMAL  POLICIES 

The  primary  aim  of  this  article  is  to  show  that  certain  assumptions  will  be  sufficient  for 
the  optimal  policy  to  have  the  economic  substitution  property.  In  addition,  we  will  demonstrate 
the  additional  properties  of  the  optimal  policy  that  are  obtained  as  a  consequence  of  perishabil¬ 
ity. 


We  define  the  term  optimal  policy  to  be  the  specific  choice  of  starting  inventory  level  that 
leads  to  the  minimum  expected  total  cost.  If  such  a  policy  exists,  we  denote  it  as  z(x)  — 
(z,(x),.  .  .,z„(x)).  Thus,  z(x)  solves 

(3.1)  fi(z(x);x)  -  inf  B{z\x),  x  €  E, 

:tZ 

with 

B(z,x)  =  G(z )  +  £  V,(.Zi  -  x,;x)t 


G(z)  -  £  +  L  (z),  z€Z. 

a  i 


We  propose  the  following  two  postulates: 

(Al)  Assume  that  V.  5(z;x)  is  a  substitute  matrix  for  each  x  6  Eh 

(A2)  Assume  that  all  sets  A  (s)  —  [z  6  Z;  G(z)  <  s)  are  compact  for  each  bounded 
real  number  s. 

The  assumption  that  V?  fl(z;x)  be  a  substitute  matrix  is  only  slightly  more  restrictive 
than  requiring  that  B{- ;x)  be  strictly  convex  with  nonnegative  crosspartial  derivatives.  The 
latter  occurs  whenever  the  marginal  total  costs  of  product  i  are  nondecreasing  functions  of  the 
inventories  of  all  other  products.  This  assumption  turns  out  to  be  sufficient  for  the  optimal 
ordering  policy  to  have  the  economic  substitution  property.  Postulate  (A2)  will  hold  under  a 
number  of  conditions  provided  postulate  (Al)  is  satisfied.  If,  for  example,  one  of  the  sets 
A  (s)  is  bounded  for  some  s',  it  is  a  property  of  convex  functions  that  A  (s)  will  be  bounded  for 
every  s.  A  sufficient  condition  is  to  require  that  G(j)  — » <»  whenever  ||>>||  —  «>,  where  INI 
is  the  Euclidean  norm  on  E„.  Finally,  the  compactness  will  follow  by  the  boundedness  of  A  (•) 
and  the  continuity  of  G(  ).  Postulate  (A2)  assures  that  the  finite  optimal  policy  exists  and  that 
"inf  can  be  replaced  by  "min"  in  Eq.  (3.1). 


tNolice  that  in  our  definition  of  we  have  not  included  the  constant  -  J  <yv,.  This  is  justified  since  the  con¬ 

stant  does  not  affect  the  optimization. 
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The  central  result  of  this  article  is  Theorem  3.1.  It  generalizes  Theorem  6  in  Ignall  and 
Veinott  [6]  to  perishable  inventory  products. 

THEOREM  3.1  If  postulates  (Al)  and  (A2)  hold  and  Z  =  [a,b]  Q  E‘,  a  unique  z(x) 
exists  and  is  continuously  differentiable.  In  addition,  suppose  z,  (x )  >  x,  for  /'  €  /'  c  /,  and 
z,(x)  —  x,  otherwise.  Let  /  6  /',  A:  €  /,  and  m  £  [k]  -  {2}.  Then,  the  optimal  policy  has  the 
following  properties: 


(PI) 


(P2) 


Bz, 

dxk 


>  0.  k  =  i. 


<  0,  k  &  i, 


(P3) 

(P4) 

(P5) 

where  z,  =  z,(x). 


Bz, 


Bx, 


km  - 1 


Bz, 

Bx, 


km 


<  0,  k  =  i. 


>  0,  k  €  /',  k  /, 


-  0,  k  <7  /', 


Before  proving  Theorem  3.1,  it  is  important  to  interpret  the  five  properties  of  the  optimal 
policy.  Property  (P2)  shows  that  the  optimal  policy  has  the  economic  substitution  property;  the 
optimal  starting  inventory  of  product  /  is  a  nonincreasing  function  of  the  initial  inventory  of 
product  k,  k  ^  i.  This  is  sufficient  to  prove  that  the  ordering  quantity  for  product  i  is  nonin¬ 
creasing.  This  result  is  the  analog  of  the  assertion  of  Theorem  6  in  Ignall  and  Veinott  [6],  Pro¬ 
perties  (PI)  and  (P3MP5)  provide  the  additional  properties  that  arise  from  perishability. 

Properties  (PI)  and  (P3)  provide  the  extension  of  earlier  work  in  perishable  inventory 
theory  on  single-product  models  to  multiproduct  situations.  Property  (PI)  shows  that  the 
optimal  starting  inventory  of  a  product  is  a  nondecreasing  function  of  its  initial  inventory.  That 
is,  the  ordering  quantity  strictly  decreases  when  initial  inventory  increases,  but  not  enough  to 
offset  the  increase.  Property  (P3)  shows  that  the  optimal  starting  inventory  of  a  product  is,  in 
general,  more  sensitive  to  changes  in  younger  units  than  in  older  units.  These  two  results  were 
discovered  by  Fries  (51  and  Nahmias  [7]  in  the  single-product  case. 

Properties  (P4)  and  (P5)  demonstrate  that  the  economic  substitution  property  is  also 
affected  by  making  changes  in  the  age  distribution  of  the  units  in  stock.  Property  (P4)  shows 
that  the  rate  of  substitution  of  product  k  for  product  /  is  more  sensitive  to  increases  in  the 
younger  units  of  product  k  than  in  older  units.  However,  Property  (P5)  shows  that  this  age- 
dependent  rate  of  substitution  only  holds  for  the  products  that  have  been  ordered— not  all  pro¬ 
ducts. 


PROOF:  The  logic  required  to  prove  Theorem  3.1  is  essentially  that  used  by  Ignall  and 
Veinott  (6],  except  that  we  need  to  account  for  the  fact  that  the  state  variable  is  x,  not  x,  and 
the  objective  function  depends  on  the  state  variable  through  the  outdating  cost  function.  All 
that  is  needed  to  make  their  proof  apply  to  the  present  theorem  is  to  generalize  their  Lemma  7, 
so  that  it  incorporates  the  above  information.  Once  this  is  accomplished,  their  proof  only  needs 
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minor  changes.  For  this  reason,  we  do  not  describe  a  proof  of  Theorem  3.1  here,  but  instead 
focus  on  the  proof  of  the  new  lemma,  Lemma  3.1. 

Q.E.D. 

Before  we  can  state  the  lemma,  some  additional  notation  is  required.  Let  t  be  fixed  and 
1  <  '  <  n,  and  let  /',  j  €  /.  Let  h,,  -  Dn  B(z\x),  h ,  —  ..,/r„),  and  H  be  the  matrix  with 

rows  h,.  Also,  let  H,  be  the  matrix  formed  from  H  by  replacing  column  /  by  h'k  ('  denotes  the 
transpose  operation). 

Now,  let  x  -  (x,+) . x„ )  and  let  ^ (x)  -  0, (x),>^2(x) . >>,(x))  solve 

SCvfxl.xix)  =  min  B(y,x;x). 


Also,  let 


u,(m)  = 


flj'(x) 

bxkm 


m  €  [/cl,  and  u(m)  —  (u,(m)„  ..,ut(m)Y. 


LEMMA  3.1  (Generalization  of  Lemma  7  in  Ignall  and  Veinott  [6]):  Let  y  =  y(x). 
1.  Let  1  <  i  <  t,  m  €  [/l.  Then, 

(a.  g-  >  0. 

9  v,  by, 

(b) 

Oxim- 1  Qxim 


2.  Let  1  <  i  t,  1  <  k  <  rt,  k  ^  i,  and  m  €  [Ar].  Then, 
(a)  ^  <  0, 


(b) 


bxkm  ~ 

ay, 


By, 


bxkm  ~ 


>  0,  m  &  2. 


3.  Let  1  <  /  <  t,  t  +  1  <  k  <  n,  m  €  [A:]  —  (2).  Then, 
by,  by, 


bx. 


km  - 1 


bx, 


0. 


km 


4.  Let  i  +  1  <  i  <  n,  t  +  1  <  k  <  n,  m  €  [Ac].  Then, 

a 


ax* 


D )  B(y(x) ,x\x)  >  0. 


The  proof  of  Lemma  3.1  requires  a  lemma.  This  we  put  first. 

LEMMA  3.2  Let  1  <  /  <  n,  \  <  k  <  n,  m  €  [Ac],  Then, 

9  (0,  k  ^  i 

where  ^km  -  D,,„  +  l  Vk (zk  -  x*;x*)  -  Du  Vk(zk  -  xk\xk). 


-% 
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PROOF:  We  simply  state  the  relevant  partial  derivatives  of  B : 

D,  B(z;x)  =  D,  G(z)  +  D,F,(z,  -  ^;x,), 

D„  B(z\x)  =  Dk  G{z)  +  Dn  V,(z,  -  x,;x,), 

D,n  S(z;x)  —  Dik  G(z),  /c  ^  /, 
g  |0,  k  &  i 

xkn  D'  S(r,x)  =  r°u  KU  -  *,;x,)  +  0lm+1K,(z,  -  x,;x ,) ,  k  =  /. 


Q.E.D. 


PROOF  (LEMMA  3.1):  Fix  i  and  such  that  1  <  i  <  t,  1  <  k  <  t,  and  let  m  €  [&]. 
Also,  let  ek  be  the  r-dimensional  unit  vector  with  a  1  in  position  k.  Then,  using  Lemma  3.2  it 
can  be  shown  that 


Hu{m)  =  \km  ek 


so  that 


j-A*„'|//&l/|J/l.  /  *  k 
"/U)  =  1  A*.  \Hkk\/\H\.  i  =  k. 

Now  \H\  >  0,  I Hkk |  >  0,  \Hkk\  >  0  from  postulate  (Al)  and  by  properties  of  substitute 
matrices.  Finally,  Akm  ^  0  from  Nahmias  [7],  This  establishes  1  (a) .  Now, 

|U>lm+1  Vk  —  D\m  Vk]  ■  \H&\/\H\,  i  *  k 
U,im  ~  1)  \DXm  Vk  —  Dlm+1  Vk\  ■  \Hkk\/\H\,  i  =  k, 

where  Vk  -  Vk(zk{\)  -  xk\xk). 


Therefore, 


>  0 ,  i  ^  k 

“i(m  -  1)  -  u,(m)  <  Q  j  =  k 


due  to  results  in  [7],  This  establishes  1(b)  and  2(b). 

Now,  fix  i  and  k  such  that  1  <  /  <  /  and  /  +  1  ^  k  <  n,  and  let  m  e  [ArJ.  Then,  Ignall 
and  Veinott  [6]  show  that 

u,(m )  =  -  \H,\/\H\  <  0. 

But,  this  relation  also  shows  that  u,(m  -  1)  -  Uj(m)  =  0.  This  establishes  2(a)  and  3. 

Now,  fix  i  and  /rsuch  that  t  +  1  <  i,  k  <  n,  and  let  m  6  [£].  Also,  let 
v(m)  =  ~  ■  D,  B  (y(x),3c;x). 

°xkm 

Then,  analogously  to  (6],  we  obtain  that 


v(m)  = 


k  hik 


'  V  iff 
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Therefore,  v(/m)  is  nonnegative  (positive  if  k  =»  i),  independent  of  m.  This  establishes  4  and 
completes  the  proof. 

Q.E.D. 

An  interesting  special  case  corresponds  to  n  =  2.  In  this  case,  it  is  possible  to  graph  the 
decision  regions  and  observe  the  economic  substitution  and  the  impact  of  perishability  on  the 
ordering  policy.  The  optimal  policy  is  defined  by  four  regions  /?,,  R2,  R ij,  and  R0.  These  in 
turn  are  defined  by  a  point  z*  and  two  functions  (x2)  and  y2  (x ,) .  The  point  z*  —  (z‘,z2)  is 
the  minimum  of  GO  over  Z.  y((x2)  is  defined  as  the  solution  to 

G(l)  Cyi(x2),x2)  =  0  for  x2  >  z2 

and 

yt(x2)  =  z*  for  x2  <  z2. 

Similarly,  _y2(x,)  is  defined  as  the  solution  to 

Ga)  (x,,j'2(x1))  =  0  for  xi  >  Z|* 
and 

y2(x,)  =  z2  for  x,  <  z,‘. 

The  following  two  corollaries  to  Theorem  3.1  provide  additional  properties  r  .'  the  optimal 
policy  in  this  special  case. 

COROLLARY  3.1:  If  z(x)  >  x,  then  z(x)  <  z*.  (This  rule«  out  the  possibility  of 
z(x)  =  z*.) 

COROLLARY  3.2:  The  regions  characterizing  the  optimal  policy  are  defined  as  follows: 

1.  x  €  R i  =  (x  €  £,;x,  <  yi(x2),x2  a  z2!  iff  z2(x)  =  x2  and 
x,  <  z i (\ )  <  y|(x2). 

2.  x  €  R2  —  (x  6  £,;x2  <  y2(X|),X(  >  z*)  iff  z,(x)  —  X|  and 
x2  <  z2(x)  <  y2(x,). 

3.  x  6  R 12  =  (x  €  £,;x  <  z*)  iff  x  <  z(x)  ^  z*. 

4.  x  €  R0  =  (R,  (J  R2  (J  R12);  iff  z (x )  =  x. 

When  /|  —  /2  —  2,  the  results  of  the  two  corollaries  can  be  depicted  as  in  Fig.  1.  Notice 
that  z |  (-)  and  z2(  )  will  be  nonincreasing. 


4.  CONCLUSIONS 

The  model  presented  and  studied  in  this  article  extends  the  work  by  Ignall  and  Veinott  (6) 
to  the  perishable  product  case.  The  primary  result  in  this  paper  shows  that  when  the  products 
are  perishable,  the  optimal  policy  has  age-dependent  economic  substitution,  under  the  assump¬ 
tions  given.  As  we  show  in  a  subsequent  work,  the  policies  developed  here  provide  a  con¬ 
venient  basis  for  constructing  approximately  optimal  policies  for  the  dynamic  problem. 
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ABSTRACT 

Previous  methods  for  solving  the  nonlinear  one-parametric  linear  program¬ 
ming  problem  min  [c<t)Tx  | Ax  =  b,  x  >  0)  for  r  €  |a,/3]  were  based  on  the 
simplex  method  using  a  considerably  extended  tableau.  The  proposed  method 
avoids  such  an  extension.  A  finite  sequence  of  feasible  bases  ( Bk  |  k  = 

1.  2 . r )  optimal  in  l/*,/*+il  for  k  =  1,  2,  . . .  ,r  with  a  =  /,  <  r2  <  .  .  .  < 

r,„i  =  fi  is  determined  using  the  zeroes  of  a  set  of  nonlinear  functions.  Com¬ 
putational  experience  is  discussed  in  the  special  case  of  /-norm  transportation 
problems. 


1.  INTRODUCTION 


Linear  parametric  programming  has  attracted  many  mathematicians  as  for  example  can  be 
seen  from  the  large  reference  list  in  Nozicka  et  al.  [3].  On  the  other  hand  there  are  only  few 
results  known  for  linear  optimization  problems  with  a  nonlinear  parameter.  Carpentier  [I I, 

Sarkisjan  [7]  and  Weinert  [9]  discuss  linear  optimization  problems  with  a  nonlinear  parameter 
in  the  objective  function.  They  consider  the  set  of  feasible  solutions 

(1.1)  P:=  {x  <E  1R"  |  Ax  -  b,  x  ^  0) 

with  real  m  x  n  matrix  A  of  rank  m  and  real  positive  w-vector  b.  For  simplicity  of  the  discus¬ 
sion  it  is  always  assumed  that  P  is  nonempty,  bounded  and  nondegenerate.  Let  /  =  [a,  fi]  with 
a,fi  6  IR,  a  <  fi  and  let  c:  /  —  IR"  a  continuous  function  Unbounded  intervals  can  be  con¬ 
sidered  in  the  same  way  with  minor  changes  in  the  assumptions.  Then  the  linear  optimization 
problem  with  nonlinear  parameter  in  the  objective  function  is  j 

(1.2)  min  {c(r)rx  |  x  €  P)  (t  6  /).  j 

In  order  to  find  for  every  t  €  /  an  optimal  solution  x(t)  it  is  sufficient  to  consider  the 
finite  set  P  of  all  bases  corresponding  to  basic  solutions  of  P.  A  finite  sequence  (Bk 

|  Ac  =  1.2.  ...,/•)  of  feasible  bases  Bk  optimal  in  t  6  [f*,r*,+|)  for  k  =  1,  2 . r  with  a  =  r,  <  ; 

t2  <  . . .  <  tr+ 1  =  fi  is  called  a. finite  optima l  solution  of  (1.2).  J 


In  the  above  mentioned  papers  the  problem  is  solved  by  means  of  the  simplex  method 
using  an  extended  tableau.  A  certain  transformation  of  the  objective  function  in  (1.2)  leads  to 
a  new  problem  of  the  form 


(1.3)  min  \\d  +  D  h(t)]Tx  I  x  €  P) 

with  a  constant  real  vector  d  and  a  constant  real  n  x  k  matrix  D.  Then  (1.3)  is  discussed  and 


fey* 
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solved  using  results  of  linear  parametric  programming.  The  number  of  columns  k  is  equal  to 
the  dimension  of  the  smallest  affine  hyperspace  containing  [cit)  |  /  6  /}  (cf.  [9]).  The  general 
linear  optimization  problem  with  one  nonlinear  parameter  is 

(1.4)  min  {c(/)rx  |  x  €  P(/)] 
with  the  set  of  feasible  solutions 

(1.5)  P(t):  =  {x  e  R"  |  A  (t)x  =  bit),  x  >  0) 

defined  by  a  real  m  x  n  continuous  matrix  function  A  :  /  — 1 •  IRm of  constant  rank  m  and  by  a 
real  positive  continuous  vector  function  b:  /  — ■  IRm.  If  Pit)  is  nonempty,  bounded  and  nonde¬ 
generate  for  all  r  €  /  and  if  the  set  Pit)  of  all  bases  with  respect  to  t  €  /  is  identical  for  all  /€/ 
then  (1.4)  can  be  solved  in  the  same  manner  as  (1.2).  If  all  component  functions  are  polyno¬ 
mials  then  a  solution  method  can  be  found  in  Ritter  [51 .  In  [5]  the  solution  method  is  based 
on  the  optimality  conditions  of  linear  programming.  This  approach  seems  to  be  more  direct 
and  avoids  the  determination  of  the  transformed  problem  (1.3). 

In  particular,  we  are  interested  in  integer  and  combinatorial  optimization  problems  having 
a  linear  characterization  P  such  that  the  basic  solutions  of  P  are  feasible  solutions  for  the  optim¬ 
ization  problem.  Then  dependence  on  a  continuously  varying  parameter  t  is  considered  only  for 
the  objective  function.  Thus,  parametric  programming  in  this  case  leads  in  a  natural  way  to 
problems  of  the  form  (1.2). 

Therefore,  we  develop  a  method  for  (1.2)  based  on  the  optimality  conditions  of  linear 
programming  in  section  2  and  discuss  shortly  the  possible  extension  to  (1.4)  under  the  above 
assumptions.  In  section  3  the  application  of  this  method  to  transportation  problems  is  discussed 
and  /-norm  objective  functions  that  occur  in  algebraic  transportation  problems  are  treated  (cf. 
[11]).  The  coefficient  functions  are  of  the  form/(/)  =  a'  with  a  €  1R+.  Finally  some  compu¬ 
tational  experience  for  /-norm  transportation  problems  is  given  in  section  4. 

2.  LINEAR  PROGRAMMING  WITH  ONE  NONLINEAR  PARAMETER 

As  mentioned  in  the  introduction  we  consider  only  the  finite  set  of  bases  P.  A  basis 
B  €  P  is  the  index  vector  of  the  basic  variables.  Then  A  denotes  the  index  vector  of  the  non- 
basic  variables.  Partitions  of  vectors  and  matrices  will  be  indexed  by  these  vectors  in  the  usual 
manner.  The  set  of  all  optimal  bases  with  respect  to  /  €  /  is  denoted  by  V(r).  We  define 

Vit’.t"):  =  n  Vit)  for  /',  /"  6  /  with  /'  ^  <". 

Then  for  B  €  P  the  optimality  set  is  KB):  =  {/  €  /  |  BzVit)).  This  set  can  be  characterized 
by  means  of  the  reduced  cost  coefficient  functions 

eft):  =  eft)  -  cH(t)T  Ah  1  A, 

for  j  —  1,2 . n  with  respect  to  a  basis  B.  These  functions  are  continuous  in  /.  Obviously 

cB(t)  =  0.  From  linear  programming  we  know 

(2.1)  LIB)  =  {/  €  /  |  C\  (/)  ^  0|. 

Thus  the  optimality  set  KB)  is  a  closed  subset  of  /.  In  the  case  of  linear  cost  coefficient  func¬ 
tions  c:l IR"  the  parameter  set  KB)  is  a  closed  interval  (cf.  [3]).  In  general  KB)  may  be 
disconnected  (cf.  [1],  [9]).  Then  KB)  is  the  union  of  mutually  exclusive  closed  intervals 
called  optimality  intervals  of  B.  For  /  €  KB)  we  denote  the  optimality  interval  containing  /  by 
/,fi  =  :  [af,  B?].  We  consider  two  sets  of  optimality  intervals  defined  by 

/":  =  [/,«  |  /  e  /  (fill 


ONE-PARAMETRIC  PROGRAMMING  AND  l-NORM  TRANSPORTATION 


189 


for  B  €  P  and  defined  by 

/,:  -  {/*  |  B  €  F(/)} 

for  t  €  /.  Obviously  I,  is  nonempty  and  finite  for  all  t  €  /.  The  existence  of  a  finite  solution 

of  (1.2)  is  clearly  equivalent  to  the  existence  of  a  finite  covering  (I,*  |  k  —  1,2 . r)  of  /.  Due 

to  the  compactness  of  /  the  following  local  characterization  holds. 

(2.2)  Local  Existence  Condition 

For  all  t  €  (a,0]  resp.  t  €  [a,0)  there  exists  B  €  P  resp.  B  €  Pwith 

(2.2.1)  a,®  <  /  <  / 3f  resp. 

(2.2.2)  af  ^  t  <  pf. 

(2.2)  is  necessary  and  sufficient  for  the  existence  of  a  finite  optimal  solution.  Necessity  is  obvi¬ 
ous  and  sufficiency  follows  by  a  compactness  argument.  We  prefer  to  prove  sufficiency  in  a 
constructive  way.  Let 

(2.3)  h  :  -  a 

tk  :  =  max  (0 |  B  €  V(t *_i)]  for  k  =  2,3,  . . . 

If  (2.2)  holds,  then  (2.3)  yields  p  finite  sequence  0k  \  k  =  1, 2,  . . .  ,r  +  1)  with  /,+)  =  0. 
Thus,  this  recursion  provides  a  solution  method  for  (1.2).  To  avoid  the  enumeration  of  VO), 
further  conditions  are  discussed  which  are  stronger  than  (2.2). 

(2.4)  Finite  Number  of  Optimality  Intervals 
Let  lB  be  finite  for  all  B  €  P. 

(2.4)  implies  (2.2).  Furthermore,  if  (2.4)  holds,  then  an  arbitrary  choice  of  the  ’next'  interval 

in  (2.3)  with  tk <  PB  , 

yields  a  finite  sequence.  The  problem  is  to  find  such  a  basis  B  6  V (/*_i).  For  a  discussion  of 
this  problem  we  assume  in  the  following  B  €  K(t)  for  r  E  [a, ft).  Then  we  have  to  check 
r  <  pB.  As  the  optimality  intervals  will  not  be  given  explicitly,  we  must  determine  them  with 
the  aid  of  the  reduced  cost  coefficients.  (2.1)  yields  the  following  characterization: 

(2.5)  0®-  inf  ({r'€  (r,/3j  |  E /:  cNU)  O')  <  0}  U  (B)). 

Numerical  determination  of  the  considered  infimum  seems  to  be  very  critical.  It  can 
easily  be  seen  that  0  f  is  a  zero  for  at  least  one  of  the  nonbasic  reduced  cost  coefficient  func¬ 
tions.  Therefore  numerical  methods  for  the  determination  of  zeros  of  continuous  functions  can 
be  applied.  Unfortunately  (3?  is  in  general  not  an  isolated  zero,  which  complicates  the  situation. 
Let 

Gf:  -  {r  6  /  |  Cj  0)  ■=  0) 

for  j  —  1,2 . n  and  B  €  P.  If  the  following  conditions  holds,  only  isolated  zeros  are  of 

interest. 

(2.6)  Finite  Number  of  Zeros 

Let  Gf  —  /  or  let  Gf  be  a  finite  set  for  all  7  =  1,2 . n  and  all  B  €  P. 
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It  is  easily  shown  that  (2.6)  implies  (2.4).  Furthermore  we  can  replace  (2.5)  by  the  deter¬ 
mination  of  an  isolated  zero  of  the  non  vanishing  reduced  cost  coefficient  functions  c,  ,  j€ 
J(B):  -  {/'  |  G,b  *  /): 

(2.7)  f(B)  :  -  min  ( (J  {G,fln  (r.0]  |  y€  J(B ))  U{/9}). 

Now  let  t'  6  (r,f).  If  cNW)  >  0,  then  [r,fl  is  a  subset  of  an  optimality  interval  of  B.  Other¬ 
wise,  r  -  pf  and  we  have  to  choose  another  solution  B  €  V(t). 

Even  the  calculation  of  the  least  zero  (2.7)  for  a  given  set  of  at  most  n  -  m  nonvanishing 
nonbasic  reduced  cost  coefficient  functions  remains  very  critical.  No  method  is  known  to  the 
authors  that  guarantees  to  find  this  very  zero. 

A  further  problem  is  to  find  a  systematic  way  of  changing  the  current  basis  in  the  case 
r  =  pf.  This  is  solved  with  the  aid  of  the  simplex  method. 

Therefore,  we  summarize  in  the  following  the  changes  due  to  a  pivot  step  of  the  simplex 
method.  The  nonbasic  variable  xN is  exchanged  for  the  basic  variable  xBlr).  The  new  solu¬ 
tion  x  corresponds  to  the  new  basis  2?  defined  by  B(i)  —  B(i)  for  all  /'  z*  rand  B(r)  —  N(s). 
N  is  defined  analogously.  The  new  reduced  cost  coefficient  functions  are  denoted  by  c.  Fur¬ 
thermore,  a  >  o  denotes  the  pivot  element  and  er  denotes  the  r-th  unit  vector  of  Rm.  Then 
from  linear  programming  we  know  the  following  in  parametric  form  for  all  t  6  /: 

(2.8.1)  cB(t)T  ABl  =  [cB  ( t )  +  (c/v(s)  U)/a)  er]TAf1 

(2.8.2)  d#(f)T-  c*(/)r-  (cy(s)0)/a)  ej  Ag'A^ 

(2.8.3)  _  •*«<,■)  ~  (*B(r)/a)  (Agl  /4/v<s))fl(,)  /  r 

Xfi(,)  “  *B(r)/a  i  -  r. 

Let  z(r)  resp.  z(r)  denote  the  values  of  the  objective  function  in  (1.2)  with  respect  to  B  and  B. 
Then 

(2.8.4)  z(i)  =  z(t)  +  (xBU)/a)  ■  cNU)(t) 

for  all  /  €  I.  Due  to  the  assumed  nondegeneracy  we  know  xB(r)  >  0. 

Now  we  perform  a  sequence  of  pivot  steps  in  the  following  way.  If  B  ^  K(r,f)  (cf.  2.7) 
then  there  exists  a  nonbasic  variable  x/v(s>  w«th 

(2.9.1)  c*(s)(r)  =  0 

(2.9.2)  cNM  W)  <  0  for  all  f  6  (r.f). 

Then  xN(s)  is  introduced  into  the  basis.  Due  to  (2.8.2)  and  (2.9.1)  we  find  B  €  V(t). 
Subsequent  pivot  steps  according  to  (2.9)  have  the  following  interpretation. 

Let  r‘  :=  min  {? (B')  \  B'  €  V(t),  pf  =  r }  with  respect  to  (2.7).  Then  cN{S)  (/*)  <  0  for 
all  t'  €  (r,r’)  in  each  such  pivot  step.  Therefore  z(t')  >  zW)  for  all  /'€  (r,r').  Thus  a  finite 
sequence  of  such  pivot  steps  yields  a  basis  B  € 

An  analogous  interpretation  can  be  given  assuming  (2.4)  instead  of  (2.6).  Then  f  in 
(2.9)  is  replaced  by  a  parameter  value  ts  defined  by 
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In  the  case  r  =  /3r8  there  exists  an  index  s  with  rs  >  r.  Then  (2.9)  holds  and  x^(s)  is  intro¬ 
duced  into  the  basis. 

We  summarize  the  above  proposed  steps  for  a  performance  of  (2.3)  in  the  following  algo¬ 
rithm.  Finiteness  and  validity  follow  from  the  above  remarks.  The  assumption  (2.6)  is  made 
but  if  only  (2.4)  holds  the  necessary  modifications  can  easily  be  found  from  (2.5)  and  (2.10). 

(2.11)  Solution  Method  For  (1.2) 

(7)  (INITIAL  SIMPLEX  STEP) 

Determine  an  initial  basis  B  €  F(a);  r,  :  =  r  :  —  a;  /c:  =  1. 

©  (LEAST  ZERO  CALCULATION) 

Determine  f  according  to  (2.7); 

if  Cj(t')  >  0  for  all  j  €  J(B)  and  a  parameter  r'  €  (r,f) 
then  go  to  (4)  . 

(3)  (PIVOTING  IN  F(r)) 

Choose  a  nonbasic  variable  x^[s)  with  cN(s)  (r)  =  0 
and  C/v(s)  (/')  <  0  for  a  parameter  /'  e  (r,f ); 
perform  a  pivot  step  introducing  xNis)  into  the  basis, 
redefine  x,  B,  /V;  go  to  ©  . 

(4)  (ITERATION) 

Bk :  -  B  ;  k  :  ■=  k  +  1 ; 

if  f  ^  B  then  tk  :  =  /3  ;  r  :  =  k  ;  stop. 

Otherwise  tk  :  =  r  :  =  f  ;  go  to  ©  . 

We  have  already  emphasized  the  numerical  difficulties  in  the  determination  of  f  (cf.  2.7). 
Furthermore  the  necessary  evaluations  of  the  nonlinear  functions  cN{t)  will  be  highly  time  con¬ 
suming  in  general  (cf.  section  4).  Therefore  simple  numerical  methods  using  only  a  few 
evaluations  should  be  preferred.  During  the  search  in  V{t)  the  exact  determination  of  f  can 
be  replaced  by  any  procedure  giving  an  answer  to  the  question  whether  the  current  basis  B  is 
optimal  in  a  small  interval  (t,t  +  e),  e  >  0  or  not.  Only  if  the  answer  is  positive  is  an  explicit 
determination  of  r  necessary. 

In  the  following  we  discuss  some  theoretical  properties  of  the  solutions  of  (1.2)  and  its 
dual.  Only  the  existence  of  a  finite  optimal  solution  of  (1.2)  is  assumed. 

We  define  the  optima!  value  Junction  2:1—  1R  by 

(2.12)  f(/):-  cg(t)  Ag'b 

for  t  €  /  and  B  €  V{t).  This  is  a  well  defined  function.  Due  to  the  existence  of  a  finite  solu¬ 
tion,  the  continuity  of  z  can  be  shown  in  the  same  way  as  in  the  case  of  linear  functions  c  (r ) . 

The  dual  of  (1.2)  is 

(2.13)  max  [yTb  |  y  €  S(f)) 
with 

(2.14)  S(f):»  (y  €  Rm  |yrA  <  c(r)r) 

for  t  €  /.  As  (1.2)  has  an  optimal  solution  for  all  t  €  /  so  has  (2.13).  A  finite  optimal  solution 
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of  (1.2)  yields  a  continuous  optimal  solution  y  :  /  —  Rm  of  (2.13)  defined  by 
y(t)T:  -  cBk  (t)TAgk 

for  t  €  [4,4+1],  k  =  1,2,  ....  r.  Continuity  in  4,  =  2, 3 . r  follows  from  (2.8.1).  The 

optimal  reduced  cost  coefficients  c  :  l  —  R  J  defined  by 

cj(t)  :  -  CjU)  -  y(t)T  A j  for  j  -  1,2,  ....  n 

are  continuous  functions  in  /. 

If  we  consider  the  more  general  problem  (1.4)  the  situation  is  only  a  little  bit  more  com¬ 
plicated.  This  is  due  to  the  assumptions  made  for  (1.4).  For  an  application  of  the  solution 
method  only  a  few  changes  are  necessary.  The  optimality  set  is  now 

(2.1) '  KB)  =  {t  €  /  |  cN(t)  >  0,  xBU)  >  0} 

with  the  reduced  cost  coefficients 

C;(r) :  =  c/t)  -  cB(t)T  A b  1  (/)  Aj(t) 

for  j  —  1,2 . n.  The  finite  optimal  solution  is  not  piecewise  constant  in  general  as  before 

but  is  defined  by 

xg(t)  :  =  Ag  1  ( r)b(t ) 

in  the  respective  intervals  of  /.  A  discussion  of  the  pivot  step  changes  in  (2.8)  shows  that  the 
dual  optimal  solution  P  and  the  optimal  reduced  cost  functions  c  may  be  discontinuous  at  those 
optimality  interval  endpoints  corresponding  to  the  zeros  of  certain  basic  variables  xB{r)(t). 
Then  the  primal  optimal  solution  is  continuous  at  such  points.  The  necessary  modifications  of 
(2.11)  in  the  definition  of  f  (resp.  pf  and  rs)  follow  easily  from  (2.1)'.  The  modified  method 
solves  problem  (1.4)  in  the  same  manner  as  before  (1.2). 

3.  TRANSPORTATION  PROBLEMS  AND  T-NORM  OBJECTIVES 

The  nonlinear  one-parametric  transportation  problem 

(3.1)  min  £  2- fyO)  •  x(/ 

.«  er  ,•  j 

with  a  continuous  function  c  :  /  — *  R"'"  subject  to 

(3.2)  T:  =  (x  |  £  x„  ==  a,,  £  x„  =  br  x„  >  0} 

I  I 

for  given  positive  real  numbers  a,,  i  *=  1,2 . m  and  bn  j  —  m  +  1,  m  +  2 . m  +  n  is  a 

special  case  of  (1.2).  W.l.o.g.  we  assume  =  £ br  Index  sets  for  i  resp.  j  are  given  expli¬ 
citly  only  if  proper  subsets  of  the  general  range  (1,2 . m)  resp.  [m  4-  1  ,m  +  2 . m+n) 

are  considered.  The  set  of  basic  resp.  nonbasic  variable  indices  is  denoted  by  B  resp.  N. 

The  primal  transportation  algorithm  (cf.  Murty  [2])  is  chosen  as  proper  modification  of 
the  simplex  method.  It  makes  full  use  of  the  special  structure  of  the  transportation  problem 
and  provides  a  direct  method  for  the  construction  of  the  nonbasic  reduced  cost  coefficient  func¬ 
tions  c„(r)  for  U,j)  €  N.  The  dual  variables  are  usually  denoted  by  w,(r),  v,(/).  With  respect 
to  a  given  basis  B  they  are  recursively  defined  by 

u,{t)  +  v,(r)  »  c„(r) 

for  all  (i.j)  €  B  starting  with  //,(/)  =  0.  The  tree  structure  of  B  makes  it  easy  to  find  the  gen¬ 
eral  form 
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At)  -  £  8 ,j  ■  C,j(t) 

B 

with  8 jj  €  (0,  ±  1)  for  all  Uj)  €  B.  Then  the  nonbasic  reduced  cost  coefficient  functions  are 
given  by 

CijU)  -  c0U )  -  Uj(t)  -  \j(t) 

for  all  Uj)  €  N.  Therefore  these  functions  are  of  the  general  form 

(3.3)  c^U)  -  cM„(r)  -  £  y.j  ■  CjjU) 

B 

with  (fi.v)  €  JVand  yy  €  {0,  ±1)  for  all  U.j)  €  B.  Coefficients  with  value  ±  2  do  not  occur  as 
‘row  nodes’  n  have  even  distance  and  ‘column  nodes’  v  have  odd  distance  from  the  ‘root  node’ 
1  with  respect  to  the  tree  B.  For  the  same  reason  yu  -  0  for  all  Uj)  on  the  common  ‘path’ 
from  Toot  node’  1  to  the  last  common  ‘predecessor’  of  n  and  v.  This  information  is  helpful 
for  the  calculation  of  the  reduced  cost  coefficients  from  the  stored  functions  Cy(t).  Only  the 
storage  of  the  iterated  vectors  y  of  length  |fl|  is  necessary.  The  storage  required  for  the  c,7(r) 
is  decreased  if  the  objective  function  has  the  form 

(3.4)  £  £  +  £  g,  ( d,).t )  x„ 

i  i  i- 1 

with  certain  functions  g,  and  constants  djh  1  =  0,1 . k.  Weinert  [9]  and  Carpentier  [1] 

consider  objective  functions  of  the  form 

(3.5)  ££  dfj  +  £  h,U)  ■  4  x„ 

i  i  /-I 

with  certain  functions  h,  and  constants  /  =  0. 1 . k.  In  (3.5)  the  coefficient  functions 

depend  only  linearly  on  d\r  This  was  necessary  in  their  approach  to  define  the  rows  of  the 
extended  simplex  tableau  (cf.  (1.3)).  The  number  of  stored  functions  is  only  k  resp.  k  instead 
of  mn.  Furthermore  k  <  k. 

A  special  case  of  (3.4)  with  k  =  1  is 

(3.6)  z(x.t) :  ££  (cy)'  •  x„ 

i  i 

with  positive  real  c,7.  For  fixed  t  €  [l,°o)  the  minimization  of  (3.6)  subject  to  (3.2)  is 
equivalent  to  the  t-norm  transportation  problem 

1!/' 

(3.7)  z(/):=  min  ££  (Cy)'x,; 

*(T  t  j 

Such  problems  are  examples  for  algebraic  transportation  problems  (cf.  [11]).  For  t  =  1  (3.7)  is 
the  classical  transportation  problem.  If  we  denote  the  objective  function  of  (3.7)  bj  z(x,r)  then 
it  is  easy  to  see  that 

lim  z(x,t)  —  d(x):  =  max  {cy  |  Xy  >  0) 

f — »oo 

for  all  x  €  T.  This  shows  the  close  relationship  of  (3.7)  with  large  values  of  the  parameter  and 
the  bottleneck  transportation  problem 

(3.8)  d:  —  min  d(x). 

x€  r 

Further  on  we  consider  the  time-cost  transportation  problem 
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d 

d(x) 

:  -  lex  min 

8 

x  €  T 

L  *u 

Cn  -  d(x) 

that  is  a  minimization  with  respect  to  the  lexicographical  ordering  of  vectors.  Among  the 
optimal  solutions  of  (3.8)  we  determine  those  with  minimum  sum  of  the  basic  variables 
corresponding  to  the  bottleneck.  In  order  to  exclude  trivial  cases  we  assume  the  existence  of 
c„  <  d  and  d  <  ck/. 

Then  let 

d+ :  -  min  (cy  |  c0  >  d) 
d- :  -  max  {cy  I  c0  <  d }. 


We  assume  the  existence  of  a  second-best  solution  of  (3.9)  with  value  8'  >  8  in  the 
second  component. ^Let  e  denote  the  minimim  value  of  a  nonvanishing  basic  variable  and  let 
V(t)  resp.  V  resp.  V  denote  the  set  of  optimal  basic  solutions  for  (3.7)  resp.  (3.8)  resp.  (3.9). 
Using  an  idea  of  Steinberg  (8]  we  find  the  following  relationships. 

(3.10)  PROPOSITION 

Let  -  [ln(I  a,)  -  In  «]  /  [In  d+  -  In  d], 

0  :  -  find  a,  -  8)  -  ln(8'  -  6)]/  [In  d  -  In  dj. 

Then 

(3.10.1)  />/§-*►  VU)  Q  V, 

t  >  max  (0,0)  -*■  V(t)  £  V. 

PROOF:  Let  x  €  V  and  x  ^  V.  Then  t  >  0  implies 

z(x,t)  <  (I  a,)  d'_<  €  •  d'+  <  z(x,t)._  _ 

Let  x€  Pand  x€  V,  V.  Then  t  >  max  (J3,) 3)  implies 
z(x,t)  <  8  •  d'  +  (I  a,  -  8)  d'_  <8 '■</'<  z(x,t). 

□ 


There  are  examples  for  which  the  bounds  in  (3.10)_are  sharp^  Furthermore  examples  are 
known  for  which  p  <  0  and  V C\V  (/)  —  <!>  for  all  t  with  0  <  t  <  0. 

For  integer  problems  the  bounds  0  and  0  can  be  simplified  using  e,  8' —  8  >  1  and 
</+  >  d  +  1,  d.  <  d  -  1.  (3.10)  shows  that  with  increasing  parameters  an  optimal  solution  of 
(3.7)  is  necessarily  an  optimal  solution  of  (3.8)  and  finally  an  optimal  solution  of  (3.9).  Due  to 
these  relations  a  parametric  study  of  (3.7)  or  equivalently  of  (3.6)  can  be  interpreted  as  discus¬ 
sion  of  various  solutions  ’between’  the  classical,  the  bottleneck,  and  the  time-cost  transporta¬ 
tion  problem.  A  detailed  analysis  can  be  given  if  (3.6)  has  a  finite  solution  at  least  on  the 
interval  (1,  max  (0,0)]. 

With  slightly  modified  assumptions  (3.10)  and  the  discussion  of  /-norm  objectives  in  com¬ 
parison  with  bottleneck -resp.  time-cost  objectives  hold  for  the  general  problem  (1.2).  Further¬ 
more  a  generalization  for  closed,  bounded  subsets  P  of  R+  holds  under  similar  assumptions. 
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Existence  of  finite  solutions  in  the  interval  [1,  max  (/3,j8)]  can  be  derived  from  the  gen¬ 
eral  form  of  the  nonbasic  reduced  cost  coefficients  (3.3)  for  the  unbounded  interval  (1,°°).  In 
the  special  case  of  (3.6)  these  functions  have  the  form  fit)  =  0  or 

fo)  - 1  (s,)'-  i  (pj‘ 

S-l  0,-1 

with  mutually  distinct  s8,  pw  >  0  for  8  =  1,2 . k  and  o>  =  1,2 . /.  If  fit)  ^  0  then 

fit)  has  at  most  k  +  /  -  1  zeros  in  R  (cf,  Polya  and  Szego  [4]).  Therefore  (2.6)  holds  with 
respect  to  A  bound  for  these  zeros  is  (cf.  Wiistefeld  [10]) 

(3.11)  r  ^  ln(max(X,/))  /  [In  A  -  In  X] 

with  X  :  =  min  {max  s8,  max  pw )  <  X  :  —  max  {max  s8,  max  pu\.  Such  bounds  are  very  useful 
in  the  calculation  of  the  least  zero  f  in  step  (2)  of  (2.1 1). 

The  optimal  value  function  zof  (3.7)  has  the  following  properties  (cf.  Wiistefeld  [10]). 

(3.12)  PROPOSITION 

Lel  J7'  ,€  R’  12 . *  and  b,  6  IN,  j  =  m  +  1,  m  +  2 . m  +  n.  Then  z: 

ll.oo)  —  R+  is  a  continuous,  monotonically  decreasing  and  piecewise  differentiable  function. 

PROOF:  The  optimal  solution  function  fulfills 

(3-13)  z(r)  -  [cBj/(r)rxflJl/' 

for  t  €  [f*,r*  +  |],  k  -  1,2 . r  with  respect  to  a  finite  optimal  solution  of  (3.6).  Thus  z  is 

continuous  and  piecewise  differentiable.  Explicit  differentiation  in  t  6  Uk,tk+\)  for  k  6 
{1,2 . r )  shows  that  the  derivative  is  nonpositive  iff 

£  (Cij)'  ■  x,j  ■  [In  c(J  -  In  z(r)]  <  0 

s* 

with  U,y):=  xBk.  Due  to  the  integrality  we  know  xtj  ^  1  if  x,y  ^  0.  Then  In  c,y  <  In  z(r). 
Together  with  (3.12)  this  yields  the  monotonicity  of  z. 

Obviously  z(r)  is  monotonically  decreasing  with  limit  d.  For  sufficiently  large  t  (cf  3.11) 
we  find  a  constant  optimal  solution  for  (3.7)  which  is  optimal  for  (3.8)  and  (3.9)  as  well. 

4.  COMPUTATIONAL  EXPERIENCE 

The  method  discussed  in  the  previous  sections  was  coded  for  r-norm  transportation  prob¬ 
lems  with  i  €  [l,/3],j8:  =  In  (Lfl/)/[ln  (d+ 1)  —  In  d\  (cf.  3.10),  in  FORTRAN  IV.  Tests  were 
run  on  the  CDC  CYBER  72/76  at  the  computer  center  of  the  University  of  Cologne. 

The  random  integer  coefficients  of  the  nxm  cost  matrices  were  drawn  from  a  uniform  dis¬ 
tribution  in  the  interval  (0,100)  resp.  from  a  (50,3)  Gauss  distribution.  The  integers  a,  and  bj 
were  chosen  in  (1,10)  with  I  a,  =  E  bj.  Twenty-five  examples  were  solved  for  each  choice  of 
the  dimension  parameters  n,  m. 

No  significant  differences  were  found  between  quadratic  (n  =  m)  and  rectangular  prob¬ 
lems  ( n  ^  m).  Therefore  we  discuss  only  the  case  n  -  m.  In  comparison  with  transportation 
problems  without  parameters,  computing  times  were  raised  enormously.  Evaluation  of  the 
functions  cM1,  (r)  in  the  determination  of  zeros  (step  2  of  2.11)  consumed  about  70%  of  the 
computing  times. 
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O  Gauss 
•  uniform 


Problems  with  Gaussian  distributed  random  integer  cost  coefficients  show  to  be  more 
difficult.  This  is  essentially  due  to  less  efficient  bounds  (cf.  3.11)  in  this  case.  Then  more 
function  evaluations  are  used  during  the  performance  of  the  bisection  method  which  was  imple¬ 
mented  for  the  determination  of  zeros. 

The  mean  running  time  is  approximately  of  order  0(/j4).  This  coincides  with  the  product 
of  the  number  of  the  reduced  cost  coefficient  functions  (n—  l)2  and  the  approximate  order 
0(n2)  of  the  number  of  pivotings. 

TABLE  1  —  Mean  Number  of  Pivoting  Steps  for 
T-Norm  Transportation  Problems  in  fl,  fij 


n  =  m 

5 

10 

15 

20 

25 

uniform 

2. 

4.4 

8.8 

13.4 

16.2 

Gauss 

1.8 

5.2 

8.5 

13.6 

19.1 

Standard  deviations  of  both  considered  means  are  rather  high  in  all  cases  (30-50%  of  the 
mean  values).  Obviously  the  practical  solvability  of  parametric  problems  is  limited  by  running 
times.  Storage  problems  are  less  important.  In  the  proposed  method,  storage  is  essentially 
given  by  the  size  of  the  matrix  of  the  cost  coefficients.  Compared  with  the  simplex  method 
using  an  extended  tableau  (cf  [9])  this  is  an  obvious  advantage. 
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ABSTRACT 

This  paper  deals  wild  the  Weber  single-facility  location  problem  where  the 
demands  are  not  only  points  but  may  be  areas  as  well  It  provides  an  iterative 
procedure  for  solving  the  problem  with  lp  distances  when  />  >  1  (a  method  ol 
ohinming  the  exact  solution  when  /i  =  I  and  distances  are  thus  reclangular  al¬ 
ready  exists)  The  special  case  where  the  weight  densities  in  the  areas  are  uni¬ 
form  and  the  areas  are  rectangles  or  circles  results  in  a  modilied  iterative  pro¬ 
cess  that  is  computationally  much  faster.  This  method  can  be  extended  to  the 
simultaneous  location  of  several  facilities. 

INTRODUCTION 

In  the  ordinary  Weber  problem,  we  must  locale  a  facility  so  that  the  sum  of  weighted  dis¬ 
tances  from  the  facility  to  n  demand  points  is  minimized.  There  are  cases  where  the  demand 
may  be  better  considered  spread  over  an  area  or  areas  than  concentrated  at  mathematical  points 
on  the  plane.  For  example,  one  could  be  dealing  with  a  very  large  number  of  demand  points  as 
in  an  urban  area.  Then  too,  a  "distributed"  demand  could  be  used  to  represent  random 
occurrences  of  demand  from  within  areas. 

When  distances  are  rectangular  (/))  and  when  the  areas  are  themselves  rectangles  with 
sides  parallel  or  perpendicular  to  the  axes  of  measurement,  the  problem  can  be  solved  exactly 
(Wesolowsky  and  Love  (7]).  The  problem  becomes  more  difficult  with  Euclidean  U2)  dis¬ 
tances.  Love  [4]  expressed  the  objective  function  in  analytic  terms  and  suggested  a  nonlinear 
optimization  procedure.  Bennett  and  Mirakhor  [2]  replaced  the  areas  with  their  centers  of  grav¬ 
ity  as  an  approximation. 

Our  approach  is  essentially  based  on  the  Weiszfeld  [6]  procedure;  that  is,  on  an  iterative 
solution  of  the  extremal  equations.  In  the  special  case  when  areas  are  rectangles  (as  assumed 
by  [21,  [4],  and  [7])  or  circles,  and  demand  is  uniformly  distributed,  we  propose  a  "special" 
iterative  procedure. 

Our  method  is  more  general  than  the  methods  in  [21  and  [4]  because  ln  distances  and  not 
only  Euclidean  distances  are  permitted.  Also,  circles  and  general  shapes  are  treated  in  addition 
to  rectangles.  Further,  our  method  appears  considerably  faster  than  Love's  method  and  gives 
more  accurate  results  than  Bennett  and  Mirakhor’s  approach. 


•This  research  was  supported  by  the  Natural  Sciences  and  Engineering  Research  Council  Canada. 
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THE  PROBLEM 

Consider  the  distribution  of  weight  density  over  a  plane.  The  weight  density  could  be 
interpreted  as  the  cost  per  unit  distance  per  unit  area  (if  the  area  is  infinitesimally  small)  of 
delivering  to  the  demand  in  that  area.  We  wish  to  find  the  location  of  a  facility  (x0,.v0)  that 
minimizes  the  total  cost  of  delivery.  We  must  therefore  minimize 

(1)  F(x0,y0)  =  f  f  w(x,y)  [|x  -  x0|'’  +  \y  -  y0\n]Un  dxdy, 

where  >v(x,.y)  is  the  weight  density,  h(x,v)  >  0,  and  [|x  -  x0|',+  |>’  -  yo\r‘\Vp.  p  ^  1,  is  the 
lr  distance  of  point  (x,>)  from  (x0,y0).  Note  that  for  p  =  1,  distances  are  rectangular  and  that 
for  p  —  2  they  are  Euclidean. 

We  will  consider  only  p  >  1  because  a  simple  procedure  for  solving  the  problem  when 
p  =  1  already  exists  [7]. 

It  is  safe  to  assume  that  in  practice  we  can  find  a  finite  circle  large  enough  so  that  {x,y) 
is  zero  outside  it.  It  will  be  helpful  to  consider  the  following  discrete  approximation  to  Eq.  (1); 
we  use  it  only  to  make  certain  provable  conclusions  about  Eq.  (1)  intuitive. 

(2)  /UoJ'o)  =  £  •Ml*/  ~  A-ol"  +  \y,  ~  ya\pY'n  dxdv. 

i- 1 


The  function  f(x0,y0 )  is  simply  the  objective  function  of  the  ordinary  Weber  problem 
with  n  demand  points.  It  is  clear  that  we  can  choose  the  weights  w,  and  the  number  n  to  make 
f(x0.y0)  approximate  F(x0..y0)  as  closely  as  desired.  We  could  thus  think  of  F(x0,y0)  as  the 
limit  of  f(x0,y0)  as  n  —  «>. 


Since  /(Xn.y'o)  is  known  to  be  convex  (it  is  a  nonnegative  linear  combination  of  convex 
distance  norms;  so  >s  F(x„,yn)  (this  could  be  shown  formally).  The  minimum  is  therefore 
defined  by  the  extremal  conditions 


dF 

dxn 


1L 

9>o 


=  0. 


Note  that  the  derivatives  of  f(x0,y0)  do  not  exist  at  (xqJ’o)  =  ( xj>y >)•  This  problem  does  not 
occur  with  F(x0,yo).  Differentiating  under  the  integral  we  have 


(3a) 


OXq  u  00  U  o° 


(x  -  x0)|x  -  x0| 


n- 2 


[  I X  -  X0|''  + 


y  nl") 


\-\/p 


dxdy  ~  0. 


(3b) 


BF 

d.V’n 


/oo  /*  °° 

I  w(x.v) 

-  oo  •/  —  oo 


(i 


i’n>  l.r  -  : 


|x  -  x,,!'’  +  l.t 


| /’ll  -Up 


dxdv  =  0. 


These  equations  can  be  used  to  construct  an  iterative  procedure  that  is  similar  to  the 
Weiszfeld  procedure  [6]  and  was  suggested  in  (5).  In  effect,  we  ’’solve”  Eqs.  (3a)  and  (3b)  and 
use  the  result  iteratively.  For  example,  from  Eq.  (3a)  we  obtain 


AO 


(*+i)  _ 


r; 

— oo  •/  — oo 


w(x,y) x \x  -  x^*1  \p  2 
x  -  xkk)  |"+  \y  -  y( !AI  YV-dp 


dxdy 


w(x.y)|x  -  x&k'  I'1-2 

xr\p+  iv  -  yr  w-',p 


dxdv 


(4) 
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where  k  stands  for  the  A.  th  iteration.  A  similar  expression  can  be  obtained  for  This 

iterative  procedure  (for  the  discrete  case  /(x0,y0)  and  thus  for  F(x(l,y(l) )  is  known  to  converge 
for  p  —  2.  It  has  not  been  proven  to  converge  for  p  ^  2,  but  to  our  knowledge  has  always  con¬ 
verged  in  practice  (5).  When  it  does  converge,  however,  the  optimality  conditions  are  satisfied 
and  the  optimum  has  been  reached. 


As  an  illustration,  let  w(x.y)  =  V?+  y2  only  inside  the  unit  circle  around  the  origin.  If 
p  —  2,  Eq.  (4)  becomes 


a  A, 


Vx2  +  y2 


*o 


(*+n 


-*1  y/(x  -  xkk)  )2  +  (y  -  yok)  )2 


dydx 


r 1 

J-i  J-VTT 


yj(x  -  x0u)  )2  +  (y  -  )2 


dydx 


Iterations  will  produce  convergence  to  the  origin.  For  example,  when  (x,J*  \y0u  ')  is  near  the 
origin  it  can  be  shown  that  x,,/*'n  ~  .Vo*1  * /2  and  yd‘‘  +  1'  =  ydu/2. 


The  numerical  computation  of  double  integrals  in  Eq.  (4)  could  consume  a  great  deal  of 
time.  We  suggest  a  more  efficient  two-stage  procedure  in  the  next  section. 


A  TWO-STAGE  ITERATIVE  PROCEDURE 


The  total  area  of  demand  is  usually  made  up  of  distinct  areas  where  w(.v,y)  =  0  between 
them.  In  any  case,  without  a  loss  of  generality,  we  can  break  up  the  total  area  into  q  subareas, 

A  i . Aq.  We  ignore  for  the  moment  the  possible  existence  of  demands  at  mathematical 

points  to  simplify  the  presentation;  such  points  can  be  easily  included  into  consideration  later. 

Under  these  assumptions,  Eqs.  (3a)  and  (3b)  become 

ff  (X  -  Xq)  \X  ~  xq\P' 2 

9*o  T!  a,  (I*  -  *0IP  +  It  -  Tol'I1'17'’  }’ 

(5a)  «  _£  FJx0.y0)  -  0 

0*0  ,_1 

dF  =  r  r  >v(x.y)  (y  -  To)  It  ~  , 

9to  ,T1  y  [I*  -  *0IP  +  ly  -  yo\rV  ^p  > 

(5b)  =  Fyi(x0.y0)  =  0. 

oTo  i-i 

The  central  idea  behind  the  proposed  iterative  procedure  is  that  given  a  location  (x0,y0)  of 
the  facility,  the  cost  of  area  i  can  be  replaced  with  the  cost  of  an  equivalent  demand  point 
whose  location  and  weight  can  be  computed.  In  effect,  we  will  then  be  using  a  Weiszfeld  pro¬ 
cedure  on  q  point  destinations. 

Consider  (x.y),  which  can  be  any  point  on  the  plane  and  any  area  A,,  such  that  Fr,(x,i ) 
*  0  and  F,,(x,y)  ^  0.  We  shall  find  (x,,y,)  and  h,',  the  location  and  weight  of  a  point  which 
will  give  derivative  components  equal  to  F„(x, f)  and  F,.,(x.y).  In  effect,  we  are  replacing  the 
area  A,  with  an  "equivalent"  point;  note,  however,  that  this  equivalence  is  dependent  on  the 
location  (x.y)  and  is  not  necessarily  true  elsewhere.  It  follows  that  at  optimality,  q  points 
(x,.y,)  with  weights  w'  can  replace  the  problem  with  areas.  Although  (x,.y,)  and  tv,  are  func¬ 
tions  of  (x.y),  we  do  not  specify  this  with  notation  to  avoid  unnecessary  notational  complexity. 
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Therefore,  by  Eqs.  (5a)  and  (5b), 


(6a) 

(6b) 


(x,  -  x)|x,  -  x)|'>  2 

w  — - - — 

'  [|x,  -x|'’  +  \y,  -yW" 

u..  Cv,  ~  J>>U  ~  >’lp~2 
[Uj-xK+l  y,-S’\p]l//’ 


Fjx.y) 

Fyi(x,y). 


Note  that  the  left-hand  sides  in  Eqs,  (6a)  and  (6b)  are  simply  the  derivatives  of 
w,  [|x,  -  x\n  +  |  v,  -  f  I'’]1'1'’  with  respect  to  x  and  y ,  respectively,  and  the  right-hand  sides  have 
the  corresponding  derivatives  for  the  term  A,. 


Dividing  Eq.  (6a)  by  Eq.  (6b),  we  get 

I x,  -  xi'1-2  fv,(x,r) 
U  -  v\"-2  ~  FJx.y) 

Let 

s,(x,>)  =  1,  if  R,(x,y)  >  0, 

=  -1,  if  R,(x,v)  <  0. 


(7) 


x,  —  x 


v,  —  .1 


R,(x,y). 


Therefore,  Eq.  (7)  implies 

(8)  y:  -  (x,  -  x)/(s,(xi)|/?,(xJ)|,/('’-")  +  .P 

which  is  the  equation  of  a  straight  line  through  (xj));  in  other  words,  (x,,j,)  is  not  uniquely 
determined.  Some  comments  on  technical  points  should  be  made.  When  (x,y )  is  such  that 
Fxi  =  0  and  Fy,  &  0,  then  x,  =  x,  and  y,  is  undetermined.  If  Fy,  =  0,  but  FXI  &  0,  then  y,  -  y 
and  x,  is  undetermined.  If  both  F„  =  0  and  Fy,  =  0,  then  (x,,y,)  =  (xJO.  Further  (barring 
Fx,  =  0  and  Fyi  =  0),  when  x,  =  x  and  y,  =  y,  then  Eq.  (8)  still  gives  the  equation  of  the  line  in 
spite  of  the  fact  that  the  left-hand  side  of  Eq.  (7)  is  undefined.  The  above  comments  could  be 
proved  analytically,  but  it  is  easier  to  justify  them  (and,  in  fact,  the  rather  surprising  result  of 
Eq.  (8)  as  well)  by  invoking  a  mechanical  analog  that  has  been  used  for  solving  the  Weber 
problem  with  point  demands  ([31,  p.  193).  First,  however,  we  complete  our  assignment  by 
finding  w/,  the  equivalent  weight  of  the  point. 


We  take  absolute  values  on  both  sides  of  Eqs.  (6a)  and  (6b).  raise  both  sides  to  the  power 
p/(  p  —  1),  and  sum  the  two  equations  to  obtain 


(w,y/'-x 


lx,  -  x|p  -i-  U  - 

llxj  -  x|p  +  U  - 


=  |i Fd'/(»’-1)  +  |/ylp/<'’-'). 


Therefore, 

(9) 


w/=  11 F 


We  now  turn  to  the  Varignon  frame,  a  mechanical  frame  analog.  Tor  an  explanation  of  the 
re1  us  Eqs  (8)  and  (9)  The  frame  is  constructed  by  drilling  holes  in  a  board  corresponding  to 
the  coordinates  of  demand  points.  Weights  corresponding  to  weights  u  are  hung  from  strings 
which  are  passed  through  the  board  from  below  and  lied  together  in  a  knot  above  It  is  easily 
demonstrated  that  the  knot  is  in  equilibrium  at  the  best  location  for  the  facility  when  p  =  2. 
However,  consider  the  knot  at  some  point  (x.f)  other  than  at  equilibrium.  Note  first  that  the 
force  exerted  on  the  knot  by  any  weight  depends  on  the  magnitude  of  the  weight  and  the  angle 
of  the  string,  not  on  the  distance  of  the  weight  from  the  knot.  This  "explains"  the  line  relation¬ 
ship  <8)  and  the  indeterminancy  of  the  exact  position  of  iv  .  Further,  we  could  imagine  the 
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areas  as  densely  filled  with  holes  and  having  very  many  strings  leading  to  the  knot.  The  condi¬ 
tion  Fjx.y)  —  0,  Fv,(x,y)  =  0  is  equivalent  to  the  knot  being  in  such  a  position  that  the  forces 
of  the  area  strings  balance;  this  explains  why  w’  is  zero  here  (as  can  be  checked  by  Eq.  (91). 
Also,  when,  for  example,  only  Fxi(x,y)  =  0,  this  means  that  there  is  no  ^-direction  component 
exerted  by  the  strings,  and  Eq.  (8)  is  replaced  by  a  vertical  line  through  x. 

The  iterative  procedure  we  now  suggest  is  as  follows.  We  start  with  some  initial  point 
(x°,y°).  We  then  convert  the  areas  A,  into  points  with  weights  w,'  (if  A,  is  a  point,  tv,'  —  w,). 
Since  the  exact  location  of  a  point  on  the  line  (8)  is  indeterminate,  we  choose  that  point  that  is 
closest  to  the  center  point  of  the  rectangle  bounding  A,.  We  now  use  the  Weiszfeld  iteration 
for  points  [5]: 


tV,’  Xi\Xj  -  x{k)\p  2 


(10a) 


>(*+n  _ 


Q 

5  Ik  -  +  k,  -  ><*)IPI1~I/" 


tv/k  - 


[k,-*<*)l/’  +  \y,~  y{k)\”Y~'lp 


The  equation  for  f(A+n  is  analogous  and  will  be  referred  to  as  Eq.  (10b).  This  procedure 
could  be  modified  to  a  hyperbolic  form  as  in  [5].  The  new  point  is  used  to  recalculate 
equivalent  weights,  then  xu  +  n  and  fu  +  n  are  recalculated,  and  so  on. 


We  do  not  have  a  proof  that  the  above  procedure  converges.  However,  we  have  found  no 
cases  where  Eq.  (10)  did  not  converge  very  quickly.  Also,  note  that  when  Eq.  (10)  does  con¬ 
verge,  the  optimum  has  been  found  because  when  the  left-hand  side  of  Eq.  (10)  is  equal  to  the 
right-hand  side,  the  extremal  equations  are  met.  The  advantage  of  Eq.  (10)  over  Eq.  (4)  is  that 
for  certain  special  cases,  we  can  find  relatively  simple  forms  for  the  weight  w'  and  for  Eq.  (8). 

The  method  can  be  extended  to  a  problem  as  in  [7]  where  several  facilities  (with  inter¬ 
shipments)  are  to  be  located.  The  Weiszfeld  procedure  can  be  adapted  to  the  multifacility  prob¬ 
lem  along  the  lines  described  on  p.  230  of  [31.  Since  only  the  demands  are  areas,  Eq.  (9)  could 
be  used  in  the  same  way.  In  the  next  section,  we  will  deal  with  the  problem  under  some  sim¬ 
plifying  assumptions  which,  incidentiallv,  have  also  been  made  by  previous  papers  on  the  sub¬ 
ject. 


SPECIAL  ASSUMPTIONS 


We  first  assume  that  the  weight  density  in  each  area  A,  is  uniform  and  we  denote  it  by 
w,".  We  now  have 


(11) 


fjx.y)  =  w,°J  J 


Or  -  jr)lx  ~  x^-1 
\x  -  x|',+  \y  -  y\py-Vr> 


dxdy. 


If  we  define  A,  by  letting  y  be  in  the  region  [<f>h(x),  <6 2 , ( -x- )  1  for  a,  <  x  <  b,  and  by  letting  x 
be  in  the  region  [*/»  1 , ,  (.v ) .  </<2,(.v)]  for  c,  <  y  ^  d,. 


FJx, f)  =  w,°  J  Ik  -  Jfl"  +  It  -  <ty 


and  therefore 

(12)  FJx.y)  =  w,°  f'  |[|*2,tv)  -  jf  +  \y  -  Pll'^  -  [ 1 , , (>• )  -  jf  I"  +  \y  - 


Similarly, 

(13)  FJx.y)  =  a-°  {Ik  |d.j,(A-)  -  v|',]|/'’-  [|.v  -.vK+  |<*,,(.v)  -  y \ r] 1  ,n]dx. 
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The  above  one-dimensional  definite  integrals  can  be  evaluated  by  numerical  integration. 
Note  that  Eqs.  (12)  and  (13)  may  not  be  usable  in  that  form  if  A,  is  not  a  convex  set;  however, 
integration  could  then  be  done  in  "pieces". 

We  now  look  at  two  special  shapes  of  areas  when  distances  are  Euclidean  (/»  —  2):  circles 
and  rectangles.  First,  we  consider  finding  the  equivalent  direction  of  a  circular  area.  It  could 
be  shown  that  this  line  of  force  always  passes  through  the  center  of  the  circle.  This  is  obvious 
by  consulting  the  mechanical  analog.  The  strings  emanating  from  the  much  drilled  circular  area 
must  pull  on  the  knot  at  Oc.y)  in  such  a  way  that  the  resultant  force  passes  through  the  center 
(recall  that  equal  weight  density  is  assumed) .  To  compute  the  weight  tv,’,  we  consider  the  point 
(x,y)  —  (0,0)  and  a  circle  of  radius  R  centered  at  (a, 0).  It  follows  that  Fyi  =  0.  Note  that 
from  symmetry  considerations  the  weight  obtained  is  the  same  as  the  weight  of  any  circle  of 
radius  R  with  its  center " a "  units  from  any  (x,y ) . 


By  Eqs.  (9)  and  (12)  and  assuming  p  =  2, 

K  =  |Fj 

=  w,°  J  ^ ([( a//?2  -  v 2  +  a)2  +  >,2]l/2  —  [(\/ R2  —  y2  —  a)2  +  y2V/2)dy 
=  w,0/?2  J  t  {[(\/l  -  y2  +  a/R )2  +  y2]1/2  -  [(Vl  -  y2  -  a/R )2  +  y2],/2)  dy 
(14)  w/  =  *v,°  irR2f(a/R) 

where 

/(«)  =  —  J"  U(>/l  -  v2  +  a)2  +  y2)1/2  -  l(Vl  -  y2  -  «)2  +  y2Yn)dy. 

The  evaluation  of  /(«)  can  be  done  accurately  by  the  evaluation  of  truncated  series  as 
described  in  the  appendix. 


We  now  turn  to  the  rectangle  shape  that  was  also  treated  in  [2],  [4],  and  [7],  In  Eqs.  (12) 
and  (13),  we  will  have  to  use 

<l>t,(y )  =  a,,  >l'2<(y)  =  b, 

ft> |, (or)  =  c,.  <t>2, (a)  =  (I, 


giving 


Fu(x,y)  =  tv,  j  ([(ft,  -  v)2  +  (>•  -  >)2]l/2  -  [(o,  -  x)2  +  (>•  -  y2)2]12)  dy 
and  a  similar  expression  for  F,  (,v,f). 


These  expressions  can  be  evaluated  to  give 


(15a) 


(15b) 


( .V  —  ft  )  |  .v  —  ft  | 
—  (.v  —  a  )  |.v 


/ 

f  -  d, 

_  f 

f  —  c. 

x  —  b, 

,/ 

x  —  b 

o  I 


,v  -  a, 


-A 


i 


Fr,  =  tv," 


(>■  -  </,)!>■  -  </, 

--  ( \  —  t )  ( r  —  v, 


A  -  d 


A  -  d 
v  —  a 


v  -  d 


./ 

v  —  h 

-  f 

\  —  c 

v  -  a 

v  —  a: 
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where 

/(a)  —  (a  V 1  +  a2  +  In  (a  +  V 1  +  a2)]. 

COMPUTATIONAL  RESULTS  AND  CONCLUSIONS 

Test  problems  with  points  (for  comparison)  and  areas  in  the  shape  circles,  rectangles,  and 
ellipses  were  run.  The  points  and  centers  of  areas  were  randomly  selected  from  within  a  10  x 
10  area.  Table  1  gives  the  computational  results.  The  procedure  was  stopped  when  the  /2  dis¬ 
tance  between  two  successive  iterations  was  less  than  10~4.  The  considerably  longer  time  taken 
for  ellipses  was  due  not  only  to  numerical  integration,  but  because  p  was  1.78  and  computing 
quantities  to  a  power  other  than  2  takes  longer. 

TABLE  1  —  Computational  Results 


No.  Demands 

Type  of  Demand 

No.  Iterations 

CDC  6400  s 

Distance 

500 

points 

11 

0.86 

h 

500 

rectangles 

6 

6.23 

h 

500 

circles 

6 

1.02 

h 

50 

ellipses 

11 

41.9 

1 1.78 

The  unconstrained  example  given  by  Love  in  [4]  was  solved  in  0.1  s  as  opposed  to  the  21 
s  on  the  Burroughs  B-5500  reported  by  Love;  the  same  solution  was  obtained  but  the  conver¬ 
gence  criteria  were  on  different  principles.  As  a  very  rough  guess  the  CDC  6400  is  5  or  6  times 
faster. 


It  should  be  noted  that  the  approximation  suggested  by  Bennett  and  Mirakhor  [2], 
namely,  replacing  areas  by  their  centroids,  could  give  widely  different  results  in  some  examples. 
For  example,  consider  the  two  rectangular  areas  in  Table  2.  The  optimum  point  is  (1.32337,0) 
with  a  cost  of  41.15065.  If  we  replace  the  areas  with  centroids,  the  solution  is  (5.5,0)  with  a 
cost  of  58.581 14. 


TABLE  2  —  A  Two-Area  Example 


i 

a, 

b, 

c, 

d, 

T 

8 

0 

1 

-0.5 

0.5 

2 

0.9 

1 

10 

-0.5 

0.5 
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Appendix 

EVALUATION  OF  f(a) 


As  given  in  Eq.  (14), 


/(a)  -  ~  y2  +  af  +  y2\ 1  ~  [(Vl  ~y2  -  «|  +  y7 ] 


dy- 


This  is  an  elliptic  integral  (see  [1])  with  the  following  properties: 
/(0)  =  0,  /(oo)  -  1, 

/( 1)  =  8/3rr,  /'(a)  >  0  for  a  >  0, 

/(a)  =  a/(  1/a). 

We  can  compute  /(a)  by  numerical  integration. 


In  standard  tables,  we  can  find  the  series  formulation 

w*+i 


/(a)  = 


Vl  +  a2 


y  2a 
*-o  1+a2 


(4  k)\ 


26k(2k)Wc'.)Hk  +  1)  ‘ 


Since  convergence  is  slow  for  a  =  1 ,  we  can  obtain  an  accelerated  formula  by  defining 
/3  =  2a/  (1  +  a2), 

ak  -  W2/2irk  -  (4A ) !/ (26*(2A ) ! (A: !)2)]/ (A  +  1), 

/(a)  =  VTTP/2J/3  4-  (V2/2 n)  (3  +  ((1  -  /32)//3)Ml  -  /32)J  -  £ 

The  function  /(a)  can  be  evaluated  approximately  as  follows: 

(a)  fora  «  [0,1], 

(i)  0  ^  a  ^  0.5, 

to  a  maximum  error  of  3  x  10-5; 

(ii)  for  0.5  <  a  <  1, 

/(a)  =  2 a  4—2-  +  a9  (0.0073  +  0.0621  (a  -  0.873)2), 

o  —  a 

to  a  maximum  error  of  2  x  10" 5 


(b)  for  a  >  1, 

use  /(a)  ■=  a/(l/a). 
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ABSTRACT 

The  waiting  time  in  the  random  order  service  G/M/m  queue  is  studied. 

For  the  Laplace  transform  we  obtain  a  simpler  representation  that,  previously 
available.  For  the  moments,  an  explicit  recursive  algorithm  ;s  given  and  car¬ 
ried  out  numerically  for  some  cases.  This  gives  rise  to  the  conjecture  that  the 
waiting-time  distribution  can  be  approximated  by  the  one  for  M/ M/m  after  a 
suitable  change  of  scale. 

1.  INTRODUCTION 

Consider  the  random  order  service  G/M/m  queue  with  interarrival  time  distribution  func¬ 
tion  F(F( 0)  =  0)  and  service' time  density  fie'11'.  Let  G  be  the  distribution  function  of  the 
waiting  time  W  (in  the  stationary  case)  conditional  on  its  being  positive,  i.e., 
i  |  ^>0).  We  shall  study  G  through  the  Laplace  transform 

1G(t)  and  the  moments  =  J*Q  t"dG(t)  of  G. 

The  model  has  been  investigated  by  LeGail  [3]  and  Takacs  [6],  who  give  the  characteristic 
function  and  Laplace  transform,  respectively,  of  W.  Carter  and  Cooper  Ill  and  Cooper  [2] 
study  G  directly  and  give  recursive  algorithms  for  its  computation.  Carter  and  Cooper  [1]  men¬ 
tion  that  their  analysis  was  motivated  by  a  study  of  the  Bell  System’s  No.  101  Electronic 
Switching  System. 

By  a  substitution  of  function  in  Takacs’  basic  differential  Eq.  (27)  we  are  here  able  to 
obtain  a  simpler  closed  expression  for  the  Laplace  transform  than  that  following  from  Takacs’ 
Eq.  (23).  Also  we  give  a  simpler  recursive  algorithm  for  the  moments  than  the  one  indicated 
by  Takacs  in  his  Eq.  (32)-(36).  The  algorithm  is  carried  out  numerically  for  some  special  cases. 
The  study  of  moments  gives  rise  to  some  conjectures  on  approximations  for  G. 

The  G/M/m/N  model  has  been  studied  by  Rosenlund  ([5]  §  10).  By  relations  (20), 
124).  (27),  (28)  and  (29)  the  Laplace  transform  of  IT  can  for  Af<«»  be  calculated  without  need 
for  numerical  integration  in  the  usual  D/ M/m  and  EKI M/m  cases,  so  that  it  might  be  prefer¬ 
able  to  approximate  the  present  infinite  waiting  room  with  a  finite  waiting  room.  Our  notation 

is 


GO)  -  P(W  < 

C(S)=  TV*: 


(3  -  mM, 

F( s)  -  Ce-"dF(t)  , 

J  o 

ilds)  =  F(J3s), 
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P  -  <  i  dF(.t))  1  (assumed  <  1), 

a(s)  -  root  2  with  smallest  absolute  value  of  the  equation  z  -  i/i(s  +  1  -  z), 

tu  —  a(0), 

A  —  as  given  by  (11)  in  Takacs  (6]  (cited  by  Cooper  ([2],  p.  186)), 

W,  -  waiting  time  of  /  th  customer, 

ys(x)  -  £  £(#--‘#  W‘\i  th  customer  finds  m  +  k  other  customers  at  arrival  x* , 

Mn(x)  -  £  £((£  +  k  customers  before  /  th  arrival)  x*, 

*-0 

W  -  random  variable  with  the  limiting  distribution  for  W,  as  /  —  00 • 

The  notation  is  adapted  to  obtain  functions  which  are  invariant  under  changes  of  time  scale. 

Let  Pk  be  an  arriving  customer’s  distribution  for  the  number  of  other  customers  in  the 
system  in  the  stationary  (long-run)  case.  From  Takacs  16]  Eq.  (9),  we  quote 

Pk  -  Atok~m,k  >  m. 


Hence  we  can  derive  the  relations 

(1)  P(W  <  t)  -  1  —  4(1  -  &>)_1  +  A  (1  -  <■>)"' G(r), 

(2)  G(s)  -  (1  -  <o)  Fs/@(a»), 

(3)  E(e~,w)  -  1  -  A  (1  -  a*)'1  +  A  (1  -  o>)''G(s), 

(4)  -  (1  -  w)f}~nM„(w) , 

(5)  f (H"0  -  >4(1 


For  m  “  1,  it  holds  that  A  “  u>(l  —  <v). 

2.  THE  LAPLACE  TRANSFORM 


From  Takacs  16]  Eq.  (27),  we  get 

(x  -*(s  +  1  -  x))  K'j(x)  +  Fs(x)  - 


(6) 


(1  -  *(s  +  1  -  x))(l  -  x)  !(s  +  1  -  x)  0  <  x  <  1. 


The  relation  between  Takacs’  notation  and  ours  is  £(s)  -  a(s)  =  y<J3s)  and 
yt(x)  -  <&03s,x).  Equation  (6)  also  follows  from  Eq.  (24)  in  Rosenlund  15],  which  was 
derived  by  different  methods  and  is  in  a  different  form  than  Eq.  (26)  in  Takacs  (6).  Before  its 
solution  we  make  the  substitution 

U,(x)  -  Fj(x)  -  (s  +  1  -  x)-’. 
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Then  from  Eq.  (6) 

U'  s(x)  +  i/s(x)/(x  -  i li(s  +  1  -  x))  -  s(l  -  x)_l(s  +  1  -  x)~2. 

Now  take  s  >  0  real  and  let  /  stand  for  either  [0,a(s))  or  (a(s),  1).  With  z  an  arbitrary  fixed 
point  in  1  we  have  for  x  in  1 


d_ 

dx 


Us(x)  exp 
exp 


dt 

*li(s  +  1  —  /)  —  t 
dt 

*li(s  +  1  —  t)  —  t 


s(l  —  x)  1  (s  +  1  —  x)  2 


whence 


£4(x)exp 


dt 

i /Ms  +  1  —  t)  —  t 


(7) 


J*  s(l  -  u)  *(s  +  1  —  w)-2exp  |j*u 


dt 


0  (S  +  1  -  t)  ~  t 


du  +  C j 


Setting  x  —  z  it  is  seen  that  the  constant  of  integration  CS!  =  Us(z).  Let  now  x  -*  a (s).  Then 
i/Ms  +  1  -  x)  -  x  ~  (a(s)  -  x) ( 1  +  i/i'(s  +  1  -  a(s)),  so  that  the  left  side  of  Eq.  (7)  tends 
to  0.  Hence  the  first  term  of  the  right  side  is  —  t/s(z)  for  x  —  a(s).  Put  Q(s)  =  U5(<o)/s. 
Then 


(8) 


(Ms)  “  f  (1  -  «)"'(s  +  1  -  u)  2  exp 

Jal  jl 


dt _ 

|//(S  +  1  -  t)  ~  t 


The  exp  factor  is  <  1.  A  comparison  with  Eq.  (23)  in  Takacs  [6]  reveals  the  relative  simplicity 
of  Eq.  (8).  From  Eq.  (2)  we  now  get  an  expression  for  G(s).  Making  substitutions  of  variable 
to  get  real  intervals  of  integration  also  for  complex  s  we  obtain,  letting 


(9) 


1  —  a(s)  —  (<u  —  a  (s ) )>». 

(w  -  a (s))  f0/,(y)~'(s  +  /vfy))-2 


(ru  -  a(s))dt 

i Ms  +/«(*))  +AU)- 1 


dy. 


The  resulting  expression  for  G(s )  holds  also  for  complex  s  with  Re(s )  >  0,  and  we  can  use 
Levy’s  inversion  formula  for  characteristic  functions,  which  for  distribution  functions  F  such 
that  F(t )  —  0  for  t  <  0  can  be  written 


(10)  Fit)  -  —  f°°  sin(r x)x_l  R e(FUx))dx, 

IT  •'O 

if  t  >  0  is  a  point  of  continuity  for  F.  The  integral  is  defined  at  least  in  the  improper  Riemann 
sense.  Inverting  G(s)  we  note  that  (fi  -  fito)/(s  +  p  —  /3u>)  is  th?  Laplace  transform  of  the 
exponential  distribution  with  mean  \/(J3  —  /3 ui).  This  is  the  distribui  m  of  waiting  time  (condi¬ 
tional  on  its  being  positive)  in  the  "first  come,  first  served"  G/M/m  queue.  See  Eq.  (14)  in 
Takacs  [6] .  We  can  now  state 


THEOREM  1:  With  Q  given  by  Eq.  (8)  or  (9)  it  holds  that 

G(s)  -  03  -  Pto)/(s  +  0  -  fito)  +  (1  -  to)(s/p)Q(s/i 3) 
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and 

Git)  -  1  -  -  ^-(1  —  to)  JQ  Im (Q(ix))  sin  (fitx)dx . 

From  Eq.  (7)  we  get 

(11)  Miix)  - -  (1  -  x)~2  -  (1  -  u)-3 

™  ( r 

This  relation  can  be  used  for  calculating  mean  wait  when  the  arriving  customer’s  queue  length 
distribution  is  not  the  stationary  one.  Applying  a  Tauberian  result  we  can  obtain 
<J'(0)  -  lims(r(s)  from  Theorem  1.  By  dominated  convergence  in  Eq.  (8)  we  obtain 

S—oo 

(12)  C7'(0)  -  03  -  jSaOw-Wl/d  -  to)). 


3.  THE  MOMENTS 

Takacs  [6]  indicates  by  his  Eqs.  (32)- (36)  a  method  of  calculating  the  moments  E(W"). 
We  shall  here  develop  a  simple  and  explicit  recursive  algorithm  for  this  purpose.  It  is  easily 
shown  that 

(13)  M„ix)  -  (-1)"  U(x)  . 

“S  [  js-0 


Let  us  differentiate  both  sides  of  Eq.  (6)  n  times  in  s  and  r  times  in  x,  putting  s  —  0  and  x  «*  to. 
Simplifying  the  resulting  equation  by  substituting 

Co  =  0, 

cr  -  (1  -  w)r~l  (-D'^’O  -  to)/r\  for  r  >  1, 

Bnr=  (1  -  to)n+r+lMnU)ito)/(n\r\), 

we  obtain  the  following  formula,  which  might  be  considered  the  most  useful  result  of  this  note: 


B„  r  “  (1  +  r-rcx) 


-i 


n  +  r  +  1 
r 


-  rc\Bn.r  +  Z 
/-l 


r 

I 

k-0 


(14) 


n  —  i  +  r 
n  —  i 


r-k 


ik  +  \)Bik+\ 


i  +  k  +  1 

k  1)’ 


n  >  1,  r  >  0. 


The  terms  with  B„r  on  the  right  side  cancel  out,  and  the  term  with  Bnr+X  vanishes,  since 
c0  -  0.  Hence  Eq.  (14)  is  a  recursion.  In  programming,  no  regard  need  be  given  to  the  term 

provided  all  data  registers  for  the  B.s  are  zero  initially.  To  get  B,  0.  2?2.o . Bp,0  we 

calculate  Eq.  (14)  for  r  =  0,  ....  p  —  n  and  n  —  1 . p.  We  get  successively 
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Takacs  [6]  gave  the  first  and  the  second  moment.  For  w  >  4  the  closed  expressions  for  D„  0 
are  complicated,  and  the  recursion  (14)  is  preferable  for  obtaining  numerical  results. 


Let  us  apply  the  results  to  the  cases^  of  constant  and  Erlang-distributed  interarrival  times. 
For  the  deterministic  case,  where  F(s)  -  e~iT  and  p  -  1/0  7",  we  define  co  by  co  — 
exp{(co  -  l)/p),  0  <  co  <  1.  It  holds  that 


(17)  cr  -  co(l-a*)-'(-log(co))7r!,  r  >  1. 

For  the  gamma  (Erlang)  case,  where  F(s)  -  (X/(s  +  k))K  (K  >  0)  and  p 
defined  by  co  -  (\+(\-u)/pK)~K,  0  <  co  <  1.  Here 


(18) 


C,  -  co(l-co)-,(l-col/^)r 


K-\+r\ 
r 


r  >  1. 


X.//3K,  co  is 


In  particular  for  the  M/M/m  queue  ( K  —  1)  we  have  co  —  p  =  \/p  and 
cr  —  (1—  co),_1co(r  >  1).  It  follows  that  for  this  case 

(19)  p3-  1203-X)-3(2  +  co)(2-co)-2. 

Table  1  gives  Bt0,  B2  o.  •••  ,  fiio.o  for  Fls)  equal  to  e~sT,  (X/(s  +  X))4,  and  X/(s  +  X),  i.e., 
for  the  queues  D/ M/m,  EJM/m,  and  M/M/m ,  and  for  the  traffic  intensities  p  =  0.5,  0.7,  and 
0.9.  We  used  the  calculator  TI  59  and  run  time  was  3.75  h  for  each  case,  in  all  33.75  h. 


TABLE  1  —  Values  of  Bn  0  for  1  <  n  <  10 


1 - - — ■ 

D/M/m 

EJ  Ml  m 

MIMIm  | 

»  \ 

0.5 

0.7 

09 

0.5 

~ ! 

0.7 

09 

0.5 

0.7  ] 

0.9 

i- : 

203188  ' 

466996 

806900 

301931 

552912 

.843335 

.5 

.7 

9 

l 1 

1.0 

1,0 

1.0 

10 

1.0 

1.0 

1.0 

1.0 

1.0 

! 2 

1.25500 

1.50053 

1  81251 

1.28835 

1.51643 

1  81482 

1  33333 

1.53846 

1.81818 

i  3 

1  97820 

3.06019 

4.76979 

2.07762 

31 1494 

4.77927 

2.22222 

3.19527 

4.79339 

4 

3.68889 

7.69510 

16.2385 

3  95696 

7.89274 

16  2857 

4.37037 

81 8958 

16  3561 

1  5 

7,75822 

22.5330 

67.1263 

8,51405 

23.3153 

67.3971 

9.72840 

24.5066 

67.8014 

1  6 

17.8677 

74.1797 

323.777 

20  1019 

77.5100 

325  506 

23  8214 

82.6458 

328.092 

i  1 

44.2026 

268  155 

1773.23 

51.0705 

283.195 

1785.31 

62  9102 

306.694 

1803  40 

1  8 

115.889 

1046.83 

10810  9 

137  706 

1118.21 

10902.1 

176.680 

1231.30 

11038  9 

1  9 

318.837 

4359.09 

72282.5 

390. 1 25 

4712.69 

73019.7 

522.226 

5281.22 

741  27  5 

Li®_ 

913  672 

19180.3 

523851 

1152.41 

20999  0 

530185 

161 1  84 

23968.8 

539726 

The  table  illustrates  the  heavy  tail  of  G  in  comparison  with  that  of  the  exponential  distribution 
with  mean  \/(/3  —  |8co),  particularly  under  heavy  traffic. 

Holding  F  fixed  up  to  a  scale  factor  and  letting  p  — •  1  we  have  lim  C|  =  1  and  lim  c,  «•  0 
for  r  ^  1.  This  gives  in  Eq.  (14) 

(20)  lim  B„r  -  (n  +  r)\/r\. 
o-i 

so  that  by  Theorem  2 

(21)  lim  (/3  -  0 a>)>„  =  (n!):. 

0  —  I 

Now  in'.)2  -  E((XY)"),  where  .V  and  >  are  independent  with  density  e  \  We  cannot,  how¬ 
ever,  deduce  that  lim  G(r/(/3  -  pw))  =  PIXY  ^  /)  =  §  e  '(1  -e~"')dy  since  the  moment 


i.  vnm 


m 
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sequence  (n ! ) 2  does  not  determine  the  corresponding  distribution  uniquely.  Neither  does  it 
seem  possible  to  establish  such  a  convergence  from  the  Laplace  transform  G. 

Even  for  moderately  heavy  traffic  Table  1  reveals  that  we  can  write  approximately 
B, ,  o  =«  (n!)  “  for  some  a(0  <  a  <  1)  determined  by  the  parameters,  so  that 

(22)  8o>)-n(n\)'+a. 

We  can  determine  a  to  make  Eq.  (22)  exact  for  n  -  3,  i.e., 

(23)  a  -  log(i?3  0)/log(6), 
where,  by  Eq.  (16)  and  Theorem  3, 

Sj.o  “  M6  -  ci)  +  12  -  8c,  +  3c,2  -  c,3](2  -  c,)~2(3  -  2c,)-'. 

The  approximation  is  not  so  good  for  light  traffic. 

It  is  further  seen  from  Table  1  that  0  depends  heavily  on  the  traffic  intensity  p  but  not 
much,  given  p,  on  the  form  of  the  interarrival  distribution  F,  although  p.„  depends  strongly  on 
F  through  to  in  the  factor  (fi  -  /3to)~".  This  suggests  that  G  can  be  approximated  by  the  distri¬ 
bution  for  M/M/m  after  a  change  of  scale.  More  precisely,  let  GK denote  G  when 
F(t)  =>  1  —  e~x' ;  then  our  results  would  indicate  the  approximation 

(24)  G(t)  =  Gp  ,(/Q3  -  /3to)/(l  -  p)) 

for  p  not  too  small.  The  indeterminancy  of  the  moment  problem  still  remains,  though. 

4.  THE  DISTRIBUTION  FUNCTION  G  FOR  M/M/m  AND  D/M/m 


For  the  queues  M/M/m  and  D/M/m ,  special  formulas  give  more  useful  results  for  the 
calculation  of  G  than  the  general  inversion  formula  of  Theorem  1.  For  M/M/m  the  result  of 
Pollaczek  (4]  seems  to  be  the  most  convenient.  It  can  be  written 


Gp.,(f)  -  1  -  2(1  -p) 


(25) 


r; 

Jo 


exp 


x  +  2  arctan 


Vp  sin 


1  —  Vp  cos  x 


cot  x  —  t(l  +  p  —  2 Vp  cos  x) 


sin  x  dx 


(1  +  p  -  2Vf)  cos  x)2(l  +  e”  C01  *) 


For  D/M/m  the  most  convenient  algorithm  seems  to  be  the  so-called  additional  conditioning 
variable  method  due  to  P.  J.  Burke,  described  in  Cooper  [2],  pp  229-230.  In  this  case  the  con¬ 
ditioning  variable,  the  number  of  arriving  customers  in  (0,f),  is  deterministic.  The  algorithm  is 
a  recursive  scheme,  which  for  D/M/m  can  be  reformulated  in  the  following  way.  Let 


//,*(/)  «  P(pW,  >  k/p  +  t\m  +  j  customers  before  ith  arrival), 

(26) 

k  -  0,  1,  ...  .  0  <  /  <  p-'. 

Then 

(27)  GO)  -  1  —  (1  —  a>)  £  to'  HjAi3pl](pt  -  \&pt)/p),  t  >  0, 

i-0 

and  Hj  k  is  determined  by  the  recursion 


214 


S.  i.  ROSENLUND 


(28) 


J+ 1  .  ,j+\~r 

"'°W  -  ,?  7TT  oTTTTv 


e  ',j>0 


»"«'>  ’  |  TTT'tj?!-"/).  '*«««.  *  >  W  * 


Forming  the  power  series 


Hk,,(x)  -  £  xJHjk(t),  0  <  x  <  1, 

J-0 

we  have 

(29)  GO)  -  1-(1  -  to _  ^pd/pU). 

From  Eqs.  (28)  we  obtain  the  recursion 

H0l(x)  -  x-'  f*  e,(“-1>(l -u)~2du 
Hk,{x)  -  x-‘  JQ  e(u~l)/p //*_].,  (u)t/u,  ^  1. 

This  results  in 

(31)  GO)  -  1  -  (1  -  (o)to-1  f'  e-^uu-2du, 0  t  <  T. 

The  formula  will  hold  for  any  arrival  distribution  Fsuch  that  F(T  —  0)  —  0. 

For  numerical  computations  it  appears  that  Eq.  (31),  when  applicable,  is  better  than  Eqs. 
(27)  and  (28),  while  for  t  >  Tffit  >  p~[)  generally  Eqs.  (27)  and  (28)  are  better  than  Eqs. 
(29)  and  (30).  In  Table  2  we  study  the  suggested  approximation  (24).  For  each  p-value,  the 


TABLE  2  -  Values  ofG  for 
D/M/m  with  approximation  (24) 


P  = 

0.5 

11 

0.8 

pt 

GO) 

appr. 

GU) 

appr. 

0 

0 

0 

0 

0 

0.25 

0.1995 

0.2315 

0.1353 

0.1580 

0.50 

0.3591 

0.3962 

0.2510 

0.2744 

0.75 

0.4868 

0.5169 

0.3501 

0.3646 

1.0 

0.5889 

0.6077 

0.4352 

0.4369 

1.25 

0.6707 

0.6775 

0.5084 

0.4964 

1.50 

0.7371 

0.7322 

0.5483 

0.5464 

1.75 

0.7885 

0.7756 

0.5872 

0.5889 

2.0 

0.8305 

0.8106 

0.6244 

0.6257 

2.5 

0.8683 

0.8624 

0.6921 

0.6858 

3.0 

0.9013 

0.8981 

0.7331 

0.7328 

4.0 

0.9482 

0.9413 

0.8028 

0.8011 

5.0 

0.9670 

0.9647 

0.8503 

0.8477 

7.0 

0.9872 

0.9859 

0.9060 

0.9054 

10.0 

0.9965 

0.9958 

0.9499 

0.9491 

17.0 

0.9997 

0.9996 

0.9848 

0.9846 
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left  column  gives  G(t)  for  D/M/m  while  the  right  column  gives  Gp  ,03/(1  -  <u)/(l  -  p)).  At 
least  for  the  larger  values  of  the  argument  the  agreement  is  seen  to  be  good  for  both  traffic 
intensities.  Since  D/M/m  might  be  denoted  E^/M/m  and  since  the  agreement  in  moments 
was  shown  to  be  better  between  EjM/m  and  M/M/m  than  between  D/M/m  and  M/M/m ,  the 
approximation  (24)  should  be  still  better  for  EK/M/m. 
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ABSTRACT 

This  paper  analyses  the  E£/M/c  queueing  system  and  shows  how  to  calcu¬ 
late  the  expected  number  in  the  system,  both  at  a  random  epoch  and  immedi¬ 
ately  preceding  an  arrival.  These  expectations  are  expressed  in  terms  of  certain 
initial  probabilities  which  are  determined  by  linear  equations.  The  advantages 
and  disadvantages  of  this  method  are  also  discussed. 


1.  INTRODUCTION  AND  BACKGROUND 

This  paper  investigates  a  oserver  queue  with  exponential  service  times  in  which  arrivals 
occur  in  batches  or  groups  of  size  X ,  where  X  is  a  random  variable.  The  time  between  the 
arrivals  of  two  groups  is  Erlang-distributed. 

The  distribution  of  X ,  the  size  of  the  arriving  groups,  is  denoted  by  {a,},  where 

oo 

a,  =  P(X  =  /') .  The  expected  batch  size  is  denoted  by  a ,  which  is  equivalent  to  £  iar 

/-I 

The  arrival  rate  of  the  groups  is  denoted  by  X .  Since  the  time  between  the  two  arrivals  is 
Erlang-k  distributed,  the  phases  change  at  a  rate  of  k\.  Units  entering  the  system  join  a  single 
queue  of  unlimited  size.  The  service  facility  is  a  group  of  c  identical,  exponential  channels  in 
parallel,  each  with  a  service  rate  gt.  The  transition  into  service  is  instantaneous  in  that,  when  a 
service  channel  is  free,  a  unit  from  the  queue  (if  the  queue  is  not  empty)  goes  into  service 
without  delay. 

Multiserver  queues  with  batch  arrivals  have  been  investigated  by  a  number  of  authors.  In 
particular,  Neuts  [8]  has  recently  shown  that  in  the  Glx/ M/c  queueing  system,  the  queue 
length  distribution  is  matrix  geometric,  provided  the  group  size  cannot  exceed  a  certain  limit  g. 

'The  research  for  this  paper  was  supported  in  part  by  the  Defense  Research  Board  of  Canada.  Grant  No.  3610-603,  and 
by  Natural  Science  and  Engineering  Research  Council  Canada,  Gram  No.  AS)  12. 
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In  a  more  restricted  scope,  Abol’nikov  [1]  and  Kabak  [7]  have  studied  the  Mxt M/c  queue. 
The  results  of  these  authors  were  later  consolidated  and  extended  by  Cromie,  et  al.  [3]. 
Cromie  [3]  also  corrected  the  waiting-time  distributions  of  the  earlier  authors,  following  a  paper 
of  Burke  [2]. 

This  paper  concentrates  on  finding  the  expected  number  of  elements  in  the  system.  How¬ 
ever,  an  extended  version  of  this  paper  is  available  [5,61  which  discusses  not  only  the  distribu¬ 
tion  of  the  number  of  elements  in  the  system  but  also  the  waiting-time  distribution  and  the 
related  measure  of  efficiency.  The  results  concerning  the  mean  waiting  time  W  are  not  being 
presented  here  partly  because  of  space  constraints  and  partly  because  once  L  is  known,  W  can 
be  evaluated  using  the  Little’s  formula,  L  —  kaW. 

2.  THE  EQUILIBRIUM  DISTRIBUTION  OF  QUEUE  LENGTH 

This  section  gives  equations  to  determine  p„„  the  joint  equilibrium  distribution  of  n  and 
r,  the  number  of  elements  in  the  system  with  the  arrival  group  being  in  the  r  th  phase, 
1  <  r  <  k.  It  also  derives  a  generating  function  for  the  number  of  elements  in  the  system, 
and  gives  the  expected  number  of  elements  at  a  random  point  in  time.  Further  resuits  can  be 
found  in  the  extended  version  of  this  paper  [6]. 

Using  the  normal  approach,  the  equilibrium  equations  for  the  system  in  question  can  be 


found  to  be 

0  <  n  <  c 

(1) 

(kk  +  np)pnr  -  (n  +  l)pp„+i.,  +  kkpn±r. ,,  2  ^  r  ^  k, 

(2) 

(kk  +  np.)p„  [  —  ( n  +  1W„+U  +  kk  £  OiPn-i,k>  n  <  c, 

/-I 

n  ^  c 

(3) 

(kk  +  cp)p„,  -  cfipn+]'>  +  kkpnr-\,  2  ^  r  ^  k, 

(4) 

n 

(kk  +  Cfj.)pnA  -  cpipn+ i.i  +  kk  £  a,pn_a,  n  ^  c. 

i- 1 

We  also  define  p„  as  the  number  of  elements  in  the  system,  that  is, 

k 

Pn  £  Pn.r- 

t- 1 

The  sum  of  all  p„  must  equal  one.  This  can  only  be  accomplished  if  p  <  1,  where  p  is  defined 
as  ak/(ctx). 

It  can  be  shown  that  the  probability  that  there  are  n  elements  in  the  system,  given  that 
the  arrival  process  is  in  phase  r,  is  equal  to 

P(n\r)  =  k  p„r. 

The  generating  function  of  P(n\r)  will  turn  out  to  be  important  later.  We  therefore  define 

Pr(z)  -  £  P(n\r)z"  =  k  £  p„,  r". 

.. -o  «-n 
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Also  of  importance  is  A(z),  the  generating  function  of  the  distribution  of  the  arrival 
group  size,  which  is 

A(z)  =  £  z"  a„. 

«-! 


It  is  shown  in  the  extended  version  of  this  paper  that  pk(z)  is  equal  to 


(5) 


If  one  defines 


PAz)  = 


1 

N 

'w' 

■< 

|i  +  it 

kk 

CU  r-l 

Z  ~  -fr  z  '  £  (c  ~  n)p„  ,  z" 

i  +  it 
kX 

z  -  it 
k\ 

k 

—  A  (z)zk 

1  +  ^ 


CP  1 
kk  z  ’ 


Pk  (z)  can  be  transformed  to  give  the  following  expression: 


F*(z) 


iL 

k 

l-i 

Z 

k  c-1 

I  I^1 

r— 1  n  -0 

1  1 

1  5 

|  > 

i  1  +  4*- 

_  ££l1 

k 

-  A  (z) 

k\ 

kk  z 

We  now  define 


V{z)  = 


C/JL  1 

kk  z 


-  A(z), 


U(z)  =  II  y'-'  z"  (c  -  n)  p„r. 

r  — 1  rr  — 0 


It  can  now  be  shown  that  V(z)  has  A:  —  I  zeros  inside  the  unit  circile,  which  we  number 
z | ,  z2,  _ z*_|.  For  these  zeros,  U(z)  must  be  zero,  which  gives  k  -  1  equations  to  deter¬ 

mine  the  p„  r 

(6)  U(z,)  =  £  £  z,"  (c  -  n)  p„,  =  0. 

r-l  rr— 0 


Here, 


y, 


l  +  m 

kk 


_  Cfl  J_ 
kk  z, 


Furthermore,  it  can  be  shown  that 

(7)  £7(1)  -  £  £  (c  -  n)  pnr  -  c (1  -  p ). 

r—  1  rr— 0 


Equations  (1),  (2),  (6),  and  (7)  are  sufficient  to  determine  pnr,  n  <  c,  r  —  l,  2.  ...  k. 
Once  the  p„,  are  determined,  other  measures  of  performance  are  obtained  easily.  For  instance, 
the  expected  number  in  the  system  immediately  before  an  arrival  epoch  can  be  shown  to  be 
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(8)  L 

where 


i  r-l  k 


«+(r-  1)-^ 
k\ 


P,,  +  cp-±(k-l)j£  +  J^"(l)J. 


A" (1)  - 


d2A  (z) 


dz 


:-l 


For  the  expected  number  in  the  system  at  a  random  epoch,  one  finds 


(9) 


L  -  — 

C/x 


a  L~  +  —  ,4 "(1)  +  a 


+  —  £  (c  -  n)  n  p„. 


Once  the  p„.r  are  known  for  n  <  c,  the  p„  r  can  also  be  calculated  for  n  >  c  as  shown  in 
[6],  Furthermore,  it  is  possible  to  obtain  waiting-time  distributions  and  the  related  measures  of 
efficiency. 

As  an  application  of  the  theory,  consider  the  E*/M/c  queueing  system  in  which  the  group 
size  follows  a  geometric  distribution.  Then 

a,  =  p(  1  —  p)'"1,  i  >  0,  0  <  p  <  1, 


- - (1  -  p) 

z 


1  +  ^ 

CM  V 

2 

—  p 

2A 

2A  z 

-  ~  (1  ~p) 
z 


-1 


A  ( z )  -  pz/[  1  -  (1  -  p)z  1  =  p 
V(z)  now  becomes 
y(z) 

If  —  is  replaced  by  x,  this  results  in  the  following  equation  for  x : 

12 

[x  -  1  +  p)  -  p  «  0. 

One  divides  this  equation  by  (x  -  1)  and  obtains  a  quadratic  equation  with  the  roots  v,  and  x2- 
One  of  the  roots,  say  xx,  is  greater  than  one,  and  this  root  gives  Z]  =  1/x,.  One  now  calculates 


l  + 

~^-x 

2A 

2A 

,  +£fL 

2A 


CM 

2A 


and  gets  from  Eq.  (6)  in  an  explicit  form 

f/(zi)  =  (c  -  0)  p0  |  +  (c  -  0)  p„  2  J'i  + 

(c  -  1)  p,  ,  z,  +  (c  -  1>  Pi  :  r,  z,  + 

. + 

Pi  -i.i  *i  1  +  Pi  -i  :  Ai  - i  1  ■ 

A  similar  equation  is  obtained  from  Eq.  (7): 

f/(l)  =  c  (1  —  p)  =  (c  —  0)  Po  i  +  (c  -  0)  p„  j  + 
(c  —  1 )  pi  |  +  (e  —  1 )  P|  :  + 
. + 

Pi-  t  I  +  Pt  -1.2- 
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Equations  (1)  and  (2)  give 

2  k  Po. i  “  M  Pi, i 
2  *  Po.2  "  M  Pi. 2  +  2  X  Po.i 


[2x  +  (c  -  2)/ll  Pc— 2. 1  -  (c  -  l)/x  Pc- 1,1  +  2\[o,  p,_3.2  +  •  •  •  +  0L- 2  P 0. 21 

[2x  +  (c  -  2)m)  Pc— 2.  2  =•  (<-'  -  1)/X  Pc-\.2  +  2X  pr_2.i. 

This  gives  2c  equations  for  the  2c  variables  p0.i«  P0.2.  Pc-1.1.  Pf-i,2«  and  we  found  that  these 

equations  can  be  solved  easily  for  c  up  to  10.  Once  the  p„,,  0  <  n  <  c,  1  <  r  <  X,  are 

known,  I-  and  £  are  given  by  Eq.  (8)  and  (9). 

3.  THE  POWER  AND  LIMITATIONS  OF  THE  METHOD 


If  k  is  low,  the  zeros  of  F(z)  inside  the  unit  circle  can  be  found  easily.  In  particular,  if 
k  *  2  and  the  group  size  distribution  is  geometric,  no  problem  arises,  as  was  shown  in  the 
preceding  section.  But  even  if  Viz)  is  a  polynomial  of  high  degree  or  transcendental,  the  zero 
of  V(z)  inside  the  unit  circle  can  be  found  readily  as  long  as  k  =  2.  For  instance,  we  had  no 
problem  finding  zt  for  the  E* / M / c  queue  with  group  sizes  that  are  Poisson  distributed.  In  gen¬ 
eral,  the  root  z,  can  be  found  as  follows.  One  first  shows  that  z,  is  in  the  range 

(10)  a  <  Z)  <  b, 


where 


a 


]+SJL 

~\ 

1+^ 

kk 

2k 

b  = 

1+2  4^ 

1+^- 

kk 

X 

Equation  (10)  is  correct  because  V(a)  <  0  and  V(b)  >  0.  Since  there  is  a  narrow  range  for 
z,,  Newton’s  method  or  the  rule  of  the  false  position  can  be  applied  to  find  Zj. 


For  higher  values  of  k ,  one  has  to  use  complex  arithmetic.  In  order  to  see  this,  the  fol¬ 
lowing  theorem  is  useful. 


THEOREM:  If  k  is  odd,  all  k  —  1  zeros  of  V{z)  inside  the  unit  circle  are  complex.  If  k 
is  even,  F(z)  has  k  -  1  complex  zeros  inside  the  unit  circle,  and  one  real  zero.  Moreover,  the 
real  zero  satisfies  Eq.  (10). 


Thus,  if  k  =  3,  one  has  to  find  two  complex  zeros  inside  the  unit  circle,  and  these  zeros 
are  conjugate.  If  k  =  4,  one  has  a  pair  of  conjugate  complex  zeros  and  one  real  zero,  and  so 
on.  To  find  a  pair  of  conjugate  complex  zeros  is  feasible.  Hence,  k  =  3  and  k  =  4  will  not 
pose  unsurmountable  problems  even  if  the  group  size  has  a  transcendental  generating  function, 
as  is  the  case  if  the  group  size  in  Poisson.  For  higher  X,  the  zeros  inside  the  unit  circle  can  be 
found  conveniently,  provided  A  (z)  is  a  real  function.  The  equation  F(z)  =  0  can  then  be  con¬ 
verted  to  a  polynomial,  and  there  are  efficient  computer  algorithms  available  to  solve  such  poly¬ 
nomials.  Indeed,  it  has  been  shown  in  another  connection  that  it  takes  only  a  few  seconds  on  a 
computer  to  find  the  roots  of  a  polynomial  of  degree  50  [4], 
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For  high  values  of  c  and  k,  the  solution  of  ck  linear  equations  that  determine  the  ck  vari¬ 
ables  p„„  0  <  n  <  c  —  1,  1  <  r  <  k,  may  pose  problems.  In  this  case  one  has  to  use  algo¬ 
rithms  that  make  use  of  the  block-diagonal  structures  of  the  system.  Further  problems  arise  in 
the  case  of  double  roots.  These  problems  are  discussed  in  the  extended  version  of  this  paper. 
They  can  occur  only  for  k  >  5  because  of  the  theorem  stated  above.  Moreover,  it  can  be 
shown  that  no  double  roots  will  occur  if  the  group  size  is  geometric  or  constant. 
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ABSTRACT 

Single  server  queues  with  general  interarrival  and  service  times  are  approxi¬ 
mated  by  queues  with  two-point  (Bernoulli)  interarrival  times  and  exponential 
service  times.  The  parameters  are  chosen  such  that  the  first  four  moments  of 
the  difference  of  the  service  times  and  interarrival  times  in  the  approximating 
system  equal  those  of  the  original  system.  The  aptness  of  the  approximation  is 
discussed  and  some  examples  are  presented  comparing  the  exact  and  approxi¬ 
mate  waiting  time  distributions.  A  more  complicated  approximation  is  pre¬ 
sented  using  the  dual  system  (exponential  arrivals,  Bernoulli  service)  for  those 
cases  where  the  original  approximation  cannot  be  used. 

A.  INTRODUCTION 

The  use  of  the  Erlang  family  of  distributions  to  approximate  interarrival  and  service  time 
distributions  in  single  server  queues  has  been  studied  by  a  number  of  authors.  Kotiah,  Thomp¬ 
son  and  Waugh  [5]  presented  an  algorithm  to  solve  for  the  l  roots  of  the  functional  equation 
arising  in  EJ EJ 1  queues.  The  method  depends  on  /  being  integer  and  the  solution  involves 
pairs  of  complex  conjugate  roots  which  are  used  in  the  expression  for  the  waiting  time  distribu¬ 
tion.  Though  the  algorithm  is  amenable  to  hand  calculation,  the  amount  of  work  required  to 
end  up  with  a  usable  equation  involving  only  real  numbers  is  substantial.  More  general 
methods  for  noninteger  /  have  been  suggested  (see  for  example,  Wishart  [9]),  although  no  sim¬ 
ple  manual  algorithms  are  given.  Marchal  and  Harris  [6]  pointed  out  that  the  approximation 
technique  can  be  improved  if,  instead  of  fitting  distributions  separately  to  the  service  time  S 
and  the  interarrival  time  T,  a  distribution  is  fitted  to  the  difference  U  =  S  -  _T,  since  the  wait¬ 
ing  time  distribution  depends  only  on  U  and  not  on  S  and  T  individually.  Their  scheme 
involved  finding  two  Erlang  distributions  such  that  the  first  four  moments  of  the  difference  of 
these  random  variables  match  the  first  four  moments  of  the  actual  difference.  Unfortunately, 
the  solution  algorithm  requires  machine  calculation. 

A  simple  approach  in  the  spirit  of  Marchal  and  Harris  is  to  assume  exponential  service 
times  and  utilize  a  Bernoulli  distribution  (B)  of  interarrival  times.  With  the  Bernoulli  distribu¬ 
tion,  the  interarrival  times  take  on  the  value  t\  with  probability  1  -  p  and  the  value  / 2  >  /) 
with  probability  p.  The  idea  is  to  approximate  a  GI/G/1  queue  by  a  B/M/l  queue,  where  the 
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parameters  p,  and  r2,  plus  the  parameter  m  of  the  exponential  distribution,  are  chosen  such 
that  the  first  four  moments  of  V  in  the  approximate  system  match  the  corresponding  moments 
of  the  original  system.  The  waiting  time  distribution  will  be  exponential  and  the  computations 
are  particularly  simple.  However,  as  will  be  seen  in  the  third  example  further  on,  this  approxi¬ 
mation  is  not  always  possible.  The  dual  system,  the  M/B/l  queue,  might  be  used  to  obtain  the 
waiting  time  approximation  in  that  case. 

The  fact  that  the  waiting  time  distribution  can  be  approximated  by  an  exponential,  at  least 
asymptotically,  has  been  known  for  some  time.  Kingman  [41  derived  an  exponential  approxi¬ 
mation  to  the  waiting  time  distribution  for  GI/G/1  queues  in  heavy  traffic,  and  Riordan  [7) 
suggests  the  exponential  approximation  to  waiting  times  in  M/G/l  queues.  As  Wishart  [9] 
points  out,  almost  any  probability  frequency  function  can  be  written  as  a  weighted,  infinite  sum 
of  integer  Erlang  distributions.  A  result  of  Smith  [8]  can  be  applied  to  show  that  the  waiting 
time  distribution  can  then  be  written  as  an  infinite  sum  of  negative  exponentials.  For  large 
values  of  the  random  variables,  the  terms  with  large  (in  magnitude)  exponents  vanish  rapidly 
and  the  term  with  the  smallest  parameter  governs  the  tail  of  the  distribution. 


2.  THE  B/M/l  APPROXIMATION 

Let  a,  be  the  /  th  moment  of  the  interarrival  times,  and  let  b,  be  the  /  th  moment  of  the 
service  times.  These  moments  could  be  either  obtained  from  knowledge  of  the  distribution  or 
calculated  from  data.  The  first  four  moments  of  U  are: 

c,  =  ft,  -  a, 

c2~  b2~  2b, a,  +  a2 

c3  “  63  ~  3/>2a,  +  lb,a2  -  a3 

c4  =  f>4  -  4 b}a,  +  6b2a2  -  46, a3  +  a4. 


If  the  B/M/l  queue  is  used  as  an  approximation,  then  the  /'th  moment  of  the  approximat¬ 
ing  interarrival  distribution  is 

a,'  -  (1  -  p)t\  +  pt'2, 

and  the  1  th  moment  of  the  approximating  service  distribution  is 
b '  — 

The  moments  of  the  difference,  c,’  are  obtained  as  are  the  c,  with  a,’  and  b replacing  a,  and  b , 
respectively.  Setting  c'  —  c,  for  /  =  1,  2,  3,  4  yields 

- - [(1  -  p)i,  +  pt 2j  =  c, 

m 

-^7  —  —  1(1  —  p)t  I  +  pt  ^  Id  —  p)t\  +  pt]  1  =  C  2 
m 

-Ar - K  ((1  -  p)t\  +  pt2]  +  -{(!-  p)i?  +  pf]]  -  [(1  -  p)r]  +  pi]}  =  r, 

mi  m‘  m 

-^7  -  ^7 1(1  -  p)t  1  +  pt2 1  +  Id  -  p)t]  +  pt]  1  -  —  Id  -  p)t ]  +  p< 2 1 

m  m  m  m 

+  [G  —  p)t]  +  pt2]  =  c4. 


WAITING  time  distribution  in  single  server  queues 


225 


These  four  equations  must  be  solved  simultaneously  for  m,  p,  tu  and  t2.  The  solution  is 
tedious  but  the  final  result  is  surprisingly  simple.  The  parameter  m  is  the  solution  to  the  cubic 
equation 

(1)  mi  +  Kxm  +  K2  —  0, 


with 


and 


_ 4 CjC3  -  c4  -  3cj _ 

2C!C2C3  +  c2c4  -  Ci2C4  -  cl  -  cl  ’ 


*2 


Cj  -  3 ctc2  +  2c{ 

2C!C2C3  +  C2C4  -  C\C4  -  cl  - 


c 


3  • 
2 


The  extraction  of  the  roots  of  a  cubic  equation  is  discussed  in  most  college  algebra  texts  and 
mathematical  handbooks,  for  example,  Burington  [1], 


The  remaining  parameters  are  calculated  as  follows:  let 

(c2c3  -  c2)m 4  +  (ctc2  -  c3)m3  +  (3 c2  -  4c2)m2 


= 


(c  2  —  c?)m2  —  1 


and 


K  4  = 


+  c2m2  —  2  cxm 


1  -  c,m 

The  larger  root  of  the  quadratic  equation 
>2-  K4y  +  K}  =  0 

is  denoted  by  y2  and  the  smaller  root  by  y\.  Then 
1  -  y\  ~  c ,m 


(2) 


P  = 


y2~  y\ 


Ll 

m 


h  = 


£ l 

m 


The  cubic  equation  in  m  may  have  more  than  one  real  positive  solution.  The  experience 
of  the  author  in  working  examples  is  that  when  this  occurs  (and  it  will  occur  if  the  third 
moment  of  the  service  time  distribution  about  its  mean  exceeds  the  third  moment  of  the 
interarrival  distribution  about  its  mean)  none  of  the  real  positive  solutions  will  yield  positive 
values  for  p ,  t j,  and  t2.  The  B/M/l  queue  cannot  be  used  as  an  approximation  in  that  case. 

If  m,  p >,  and  t2  are  obtained  as  above,  then  the  equation 

(3)  (1  -  p)e  1  +  pe  2  -  z  =  0 

has  one  real  root  in  the  interval,  0  <  z  <  1;  call  this  root  Z.  The  expected  waiting  time  prior 
to  service  in  this  approximate  system  is 

7 

W'  «  - - - 

q  (1  —  Z)m 

The  probability  that  an  arrival  does  not  have  to  wait  is 
I^;(0)  -  1  -  Z, 
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and  the  probability  of  a  wait  longer  than  t  prior  to  the  start  of  service  is 
1  -  W'q(t)  -  Ze~m(l~z)l 
(see  Gross  and  Harris  [3],  pages  277-279). 

3.  SOME  EXAMPLES 

EXAMPLE  1 :  For  the  EJ EJ\  queue  with  k  —  1.5,  1/A  —  2.5,  /  -  6,  l//u  —  2,  the  algo¬ 
rithm  of  Kotiah  et  al.  [5]  yields  the  exact  probability  of  a  wait  greater  than  / : 

1  -  Wq(t)  =  0.802<r°  252'-  0.013e-4  740' +  0.050e~2  271' 

[cos(  1.234  +  1.805/)  -  0.649  cos(0.047  +  1.805/)] 

+  0.02  le-4  008'  [cos(2.193  +  1.470/)  -  0.594  cos(0.021  +  1.470/)]. 

The  approximating  system  is  based  on  the  interarrival  moments 


a,  =  —  n  (1-5+7) 
lD  /- o 


and  service  time  moments 

b,  =  \  ri  («+/)• 

6  i-0 

These  are  used  to  calculate  the  c,  and  A,  and  K2 ■  The  cubic  equation  becomes 
ml  -  0.5486m  -  0.1405  “  0 

which  has  m  =  0.8454  as  its  single  real  positive  solution.  A3  and  A4  can  now  be  calculated  to 
yield  the  quadratic  equation 

y2  -  6.9699y  +  5.4373  =  0. 

The  roots  are  y(  =  0.8951,  y2  =  6.0748.  Thus,  p  —  0.1019,  /]  =  1.0587,  t2  =  7.1854,  and  the 
approximating  distribution  is 

1  -  H^(r)  =  0.71  le"0  244'. 

The  exact  and  approximating  distributions  are  compared  in  Table  1  as  are  the  means  of  the 
interarrival  and  service  time  distributions,  the  mean  waiting  times,  and  the  traffic  intensities. 
These  expectations  are  shown  to  demonstrate  that  in  spite  of  the  fact  that  the  approximating 
system  does  not  resemble  the  exact  system,  the  waiting  time  distributions  are  close  to  each 
other. 

EXAMPLE  2:  For  the  queue  with  hyperexponential  interarrivals  with  probability  density 
function 

a(t)  =  (2/9)e~(2/3)'+  (8/9)e"(4/3)' 
and  hyperexponential  service  with  density  function 
*</)  —  (l/4)<r'+  (9/4)  e-3', 
the  probability  of  a  wait  longer  than  /  is 

1  -  WqU)  -  0.420<r°6"'  +  0.106c~2  327' 

(see  Greenberg  (21).  The  interarrival  moments  are 
a  =  / !  ( 3  1  +  1.5'_l)/2'. 
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TABLE  1  —  Comparison  of  Exact  and 
Approximate  B/M/l  Systems 


Example  1 

Example  2 

Exact 

Approximate 

Exact 

Approximate 

Mean  interarrival  time 

2.5 

1.683 

1 

1.377 

Mean  service  time 

2 

1.183 

0.5 

0.877 

Traffic  intensity 

0.8 

0.703 

0.5 

0.636 

Average  delay 

Prob  (delay  >  /) 

t 

0 

3.16 

2.92 

0.73 

0.78 

0.748 

0.711 

0.526 

0.470 

1 

0.619 

0.557 

0.238 

0.257 

2 

0.485 

0.437 

0.125 

0.140 

3 

0.377 

0.342 

0.067 

0.077 

4 

0.293 

0.268 

0.036 

0.042 

5 

0.227 

0.210 

0.020 

0.023 

6 

0.177 

0.165 

0.011 

0.013 

7 

0.137 

0.129 

0.006 

0.007 

8 

0.107 

0.101 

0.003 

0.004 

9 

0.083 

0.079 

0.002 

0.002 

10 

0.065 

0.062 

0.001 

0.001 

and  the  service  time  moments  are 

b,  =  ,!(4'-'  +  (4/3)'-')/4'. 

The  cubic  equation  becomes 

m3  -  1.0335m  -  0.3057  =  0, 

which  has  m  =  1.1408  as  its  single  real  positive  solution.  The  approximating  distribution  is 
1  -  J V'q(t)  =  0.470e-°604', 

and  the  exact  and  approximate  systems  are  compared  in  Table  1. 


EXAMPLE  3:  For  the  queue  with  Erlang  interarrivals  with  k  =  3  and  1/A 
hyperexponential  services  with  density  function 

bit)  -  0.04c-°4'+  3.24e-36', 

the  probability  of  a  wait  longer  than  t  is 

1  _  wq(t)  -  0.587e-°168'  +  0.079c-2  863' 

(see  Greenberg  [2]).  The  moments  are 


a, 


fl  (3  +  j) 

j-0 


and 


b,  -  f!(10'-*  +  (10/9) '— 1)/4'. 


The  cubic  equation  becomes 


2/3,  and 


m3  -  2.0457m  +  0.7495  =  0. 
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which  has  two  real  positive  solutions:  m  -  1.899  or  0.3969.  Using  the  first,  A 3  -  — 15.11  and 
using  the  second  A3  —  -0.11.  Thus  the  quadratic  equation  in  y  will  have  a  single  positive  root 
and  the  appropriate  solutions  for  p ,  and  t2  do  not  exist.  In  this  case,  no  B/M/l  queue  can 
be  used  as  an  approximation. 


4.  THE  M/B/l  APPROXIMATION 


If  the  M/B/l  queue  is  used  as  an  approximation  with  L  as  the  parameter  of  the  exponen¬ 
tial  interarrival  time  distribution  and  service  times  u,  (with  probability  1  -  q)  and  u2  >  u , 
(with  probability  q ),  then  the  /  th  moment  of  the  approximating  interarrival  distribution  is 
a',  —  i\/L‘  and  the  /  th  moment  of  the  approximating  service  time  distribution  is 
b '  —  (1  —  q)u\  +  qu2.  The  ensuing  cubic  equation  which  must  be  solved  for  L  differs  from 
Equation  (1)  only  in  one  sign: 

L}  +  A, I,  -  K2  -  0 

with  K\  and  K2  as  in  Section  2.  Note  that  the  real  positive  roots  of  this  equation  are  the  abso¬ 
lute  values  of  the  real  negative  roots  of  Equation  (1). 


The  remainder  of  the  derivation  is  parallel  to  that  of  the  B/M/l,  resulting  in  a  quadratic 
equation  similar  to  the  one  in  Section  2: 


where 


and 


y2-K6y  +  *5  =  0 

(C)C3  -  c22)L4  -  (c,c2  -  c3)L 3  +  (3c2  -  4c2) L2 


A 


*,- 


(c2  -  c2)L2  -  1 
*5  +  c2L 2  +  2  C\L 


1  +  C\L 

The  two  roots  are  y2  and  ,V|  <  y2.  The  estimate  of  q  is  similar  to  the  estimate  of  p ,  Equation 
(2),  again  differing  only  in  one  sign: 

1  -  y,  +  CjZ. 


Q  = 


y 2  -  y\ 


and 


U|=y,/L,  u2  -  y2! L. 


The  expected  waiting  time  prior  to  service  for  this  approximating  M/B/l  queue  is 


Lb2 

2(1  -  r)  ' 


where  r  is  the  traffic  intensity  of  the  approximating  system  given  by 


/•=/,[(!-  +  qu2\- 


The  Laplace-Stieltjes  transform  of  the  Bernoulli  service  time  distribution  is 
(1  —  t/)exp(—  flt/|)  +  q  exp(— #«,).  This  can  be  substituted  into  the  Pollaczek-Khinchine  for¬ 
mula.  This  latter  expression  can  be  expanded  in  an  infinite  series  and  the  binomial  theorem 
invoked  twice  to  yield  the  transform  of  the  approximating  waiting  time  distribution: 
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exp(-9i)dW'q(t)  = 


(1  ~r)  £/.'  1 I 

i-0  k  -0  .-0 


[-(1  -  q)V  '(-q)' 

(A  -  -  A)! 


iL 


exp(-#[/u;  +  (k  —  / )  m  1 1 J . 


Reversing  the  order  of  summation,  summing  over  _/,  and  multiplying  by  9  yields  the 
transform  of  H^(r),  which  can  be  inverted  term  by  term  and  subtracted  from  unity  to  yield  the 
probability  of  a  wait  longer  than  t  : 


1  -  W'U) 


/( 1 )  / <  2) 

1  ~  (1  -  r>  £  (-1)A  £ 

A-0  /-0 


-  iu 2  —  (A  -  / ) m 1 1 J ' 

_ 


{(1  -  q)  L  [/  -  iu2  -  (A 
(A  -  /)! 


exp(L  [r  -  /u2  —  (A  -  /)«,]) 


where  /(l)  is  the  largest  integer  less  than  or  equal  to  i/u\  and  1(2)  is  the  largest  integer  less 
than  or  equal  to  ( i  -  kui)/(u2  -  u ,). 


This  expression,  although  a  finite  sum,  is  quite  difficult  to  evaluate  numerically  even 
using  double  precision  arithmetic  on  a  computer.  The  approximation  to  the  queue  of  example 
3  requires  a  sum  of  thirty  terms  for  the  probability  of  a  wait  longer  than  ten  units,  a  probability 
of  about  0.1,  with  a  number  of  the  terms  exceeding  109  in  magnitude. 


An  "approximation  to  the  approximation"  can  be  used  to  obtain  a  simpler  result,  using  the 
Riordan  approximation  alluded  to  earlier.  For  the  M/B/l  queue  the  Riordan  approximation 
becomes 


1  -  W'(r)  ~ 


1  -  Z.  [( l  -  q)u]  +  qu2] 


L  [(1  -  </)«,<'  ,'<l  />"'+qu2e 
where  Z  >  1  is  the  largest  root  of  the  analog  of  Equation  (3): 

,,  ,  — rU-:>K.  -/.(I-.-)ii,  n 

(1  -  q)e  '+qe  *— z  =  0. 


-  1 


exp|A(l  -  Z)i] 


EXAMPLE  3  ( continued ) :  The  cubic  equation 
LJ  -  2.0457E  -  0,7495  =  0 

has  one  real  positive  solution,  L  —  1.5868.  Note  that  m  =  -1.5868  is  the  third  root  of  the 
cubic  equation  for  m  in  the  first  part  of  this  example.  K s  and  A  „  can  now  be  calculated  to  yield 
the  quadratic  equation 

y2  -  15.9005.V  +  8.9137  =  0 

with  roots  >’|  =  0.5819,  y2  =  15.3187.  Hence  q  =  0.0104,  n,  =  0.3667,  and  u:  =  9.6538.  The 
approximating  traffic  intensity  is  r  =  0.735. 

The  Pollaczek-Khinchine  sum  for  1  —  H^(r)  can  be  evaluated  at  this  point.  The  Riordan 
approximation  requires  the  largest  root  of 

(1  -  0.0104)e“n  5868,(1  -><o 3667)  +  o 0104c  n  5868111  l",  6538>  _  -  =  q 

which  is  ;  =  1.1073  =  Z.  Hence, 

1  -  W’Ji)  =  0.6057c  °  170V. 
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The  exact  probability  of  having  to  wait  longer  than  time  t  is  contrasted  with  the  M/B/l  approxi¬ 
mation  based  on  the  Pollaczek-Khinchine  formula  and  with  the  Riordan  approximation  in  Table 
2.  The  expected  waiting  time  of  the  approximating  M/B/l  system  is  W'q  -  3.3  compared  to  the 
true  value  for  the  original  system  of  3.5. 


TABLE  2  —  Approximations  for 
Example  3 


Probability  of  Delay  >  t 

t 

Approximations 

Exact 

M/B/l 

Riordan 

0 

0.666 

0.736 

0.606 

0.5 

0.559 

0.483 

0.555 

1 

0.501 

0.389 

0.511 

2 

0.420 

0.339 

0.431 

3 

0.355 

0.312 

0.363 

4 

0.300 

0.285 

0.306 

5 

0.253 

0.257 

0.258 

6 

0.214 

0.227 

0.218 

7 

0.181 

0.197 

0.184 

8 

0.153 

0.166 

0.155 

9 

0.129 

0.133 

0.131 

10 

0.109 

0.107 

0.110 

REFERENCES 

[1]  R.S.  Burington,  Handbook  of  Mathematical  Tables  and  Formulas  (Handbook  Publishers.  Inc., 

Sandusky,  Ohio,  pp.  7-9,  1953). 

[2]  I.  Greenberg,  "Single  Server  Queues  with  Hyperexponential  Service  Times,"  Naval  Research 

Logistics  Quarterly,  24,  pp.  451-455  (1977). 

[3]  D.  Gross  and  CM.  Harris,  Fundamentals  of  Queueing  Theory ,  (Wiley,  New  York,  1974). 

[4]  J.F.C.  Kingman,  "The  Heavy  Traffic  Approximation  in  the  Theory  of  Queues,"  in  Proceed¬ 

ings  of  the  Symposium  on  Congestion  Theory  (University  of  North  Carolina  Press,  Chapel 
Hill,  North  Carolina,  1962). 

[5]  T.C.T.  Kotiah,  J.W.  Thompson  and  W.A.O.  Waugh,  "Use  of  Erlangian  Distributions  for 

Single  Server  Queueing  Systems,"  Journal  of  Applied  Probability,  6,  pp.  584-593  (1969). 

16]  W.G.  Marchal  and  C  M.  Harris,  "A  Modified  Erlang  Approach  to  Approximating  GI/G/1 
Queues,"  Journal  of  Applied  Probability,  13,  pp.  118-126  (1976). 

(7]  J.  Riordan,  Stochastic  Service  Systems  (Wiley,  New  York,  New  York,  1962). 

(8]  W.L.  Smith,  "On  the  Distribution  of  Queueing  Times,"  Cambridge  Philosophical  Society 

Proceedings,  49,  pp.  449-461  (1953). 

(9]  D.M.G.  Wishart,  "A  Queuing  System  with  Service  Time  Distributions  of  Mixed  Chi- 
Squared  Type,"  Operations  Research,  7,  pp.  174-179  (1959). 


CONGESTION  TOLLS:  EQUILIBRIUM 
AND  OPTIMALITY 

Robert  W.  Rosenthal 

Bell  Telephone  Laboratories,  Inc. 

Murray  Hill.  New  Jersey  07974 


ABSTRACT 

An  example  of  a  network  with  flow  costs  depending  on  congestion  is 
presented  for  which  no  system  of  tolls  and  subsidies  exists  which  can  ensure 
that  all  equilibria  in  the  game  of  route  selection  are  Pareto  optimal. 


It  has  long  been  recognized  that  in  transportation  facilities  subject  to  congestion  delays, 
equilibrium  flows  may  not  be  Pareto  optimal.  The  imposition  of  tolls  has  been  a  much  dis¬ 
cussed  remedy  (see,  for  example,  the  classical  treatments  by  Pigou  [4]  and  Knight  [3]  and  the 
more  recent  discussions  in  Kahn  [2]  and  Edelson  [1]).  The  purpose  of  this  note  is  to  show  that 
an  all-knowing  central  authority,  using  a  system  of  tolls  or  a  system  of  tolls  and  subsidies,  can¬ 
not  in  general  force  an  outcome  which  is  Pareto  optimal  or  even  Pareto  superior  to  a  given 
nonoptimal  equilibrium. 


Consider  a  network  containing  only  two  nodes,  A  and  B,  which  are  connected  by  two  dis¬ 
tinct  arcs,  1  and  2.  Four  players,  1,  2,  3,  and  4,  must  travel  from  A  to  B.  The  cost  (or  travel 
time)  incurred  by  player  /  on  arc  k  when  the  total  traffic  on  arc  k  is  xk  is 


c.*(**) 


2 

4 
3 

5 


if  /  k,  xk  <  2 


if  /  ^  k,  xk  >  2 
if  /  =  k,  xk  <  2 
if  i  =  k,  xk  >  2 


for  /  =  1,2 


=  0  otherwise. 


Thus,  players  1  and  2  incur  uniformly  higher  costs  on  arcs  I  and  2,  respectively,  while  the  costs 
for  players  3  and  4  are  0,  independent  of  route  and  traffic.  It  is  easy  to  see  that  the  Pareto 
optimal  (costs  being  minimized)  route  selections  result  in  the  unique  cost  vector  (2, 2, 0,0)  and 
involve  player  1  travelling  on  arc  2,  player  2  travelling  on  arc  1,  player  3  travelling  on  either 
arc,  and  player  4  travelling  on  the  arc  not  traversed  by  3.  The  set  of  Nash  equilibria  of  the  pure 
strategy  game  of  route  selection,  however,  consists  of  all  the  configurations  of  two  players  on 
each  arc.  The  resulting  cost  vectors  are  (2, 2, 0,0),  (2, 3, 0,0),  (3, 2, 0,0)  and  (3, 3, 0,0). 
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Suppose  that  a  central  authority,  knowing  the  above  data  and  fearing  a  nonoptimal  equili¬ 
brium,  wishes  to  force  a  Pareto  optimal  selection  of  routes  for  the  players.  Consider  first  a  sys¬ 
tem  of  tolls  on  the  arcs  in  the  form  of  some  otherwise  worthless  medium  (say  tokens)  together 
with  subsidies  of  tokens  to  the  players.  (In  our  usage  a  toll  is  a  fixed  charge  which  must  be 
paid  by  every  unit  of  flow  using  a  particular  arc.  A  subsidy  is  a  gift  to  a  player  which  is 
independent  of  route  selection.  Different  players  may  receive  different  subsidies,  and  different 
arcs  may  be  assigned  different  tolls;  but  different  players  pay  the  same  toll  if  they  use  the  same 
arc.)  No  matter  what  the  subsidies  are  (as  long  as  all  players  can  afford  at  least  one  of  the  arcs) 
there  is  no  way  to  stop  players  3  and  4  from  travelling  on  the  arc  with  the  cheaper  toll  (or  on 
the  same  arc,  if  both  tolls  are  equal).  This  cannot  result  in  a  Pareto  optimal  selection. 

Similarly,  if  the  tolls  and  subsidies  are  in  the  form  of  money,  3  and  4  cannot  be  kept  from 
travelling  on  the  cheaper  arc. 

A  trivial  way  out  of  the  dilemma  in  general  is  to  use  a  different  kind  of  token  on  each  arc 
in  the  network  and  to  subsidize  each  player  exactly  the  required  number  of  units  of  each  kind 
of  token  for  the  route  desired.  This  method,  of  course,  can  be  used  to  force  every  player  to 
use  the  route  deemed  desirable  for  him.  It  does  not  appear  to  be  practical  in  large  networks, 
however. 
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ABSTRACT 

The  problem  of  determining  a  vector  that  places  a  system  in  a  state  of 
equilibrium  is  studied  with  the  aid  of  mathematical  programming.  The  ap¬ 
proach  derives  from  the  logical  equivalence  between  the  genera]  equilibrium 
problem  and  the  complementarity  problem,  the  latter  being  explicitly  concerned 
with  finding  a  point  in  the  set  S  -  (x  :  <  x,g(x)>  =  O.g(x)  <  0,  v  £  0)  . 
An  associated  nonconvex  program,  mini-  <  x.glx )  >  :  g(x)  <  0,x  >  0).  is 
proposed  whose  solution  set  coincides  with  S.  When  the  excess  demand  func¬ 
tion  g(x)  meets  certain  separability  conditions,  equilibrium  solutions  are  ob¬ 
tained  by  using  an  established  branch  and  bound  algorithm  Because  the  best 
upper  bound  is  known  at  the  outset,  an  independent  check  for  convergence  can 
be  made  at  each  iteration  of  the  algorithm,  thereby  greatly  increasing  its 
efficiency.  A  number  of  examples  drawn  from  economic  and  network  theory 
are  presented  in  order  to  demonstrate  the  computational  aspects  of  the  ap¬ 
proach.  The  results  appear  promising  for  a  wide  range  of  pro.blem  sizes  and 
types,  with  solutions  occurring  in  a  relatively  small  number  of  iterations. 


1.  INTRODUCTION 

In  this  paper  we  investigate  a  procedure  for  computing  equilibria  from  the  vantage  point 
of  mathe  latical  programming.  A  competitive  model  of  an  ecomony  will  serve  as  the  basis  for 
the  discussion  although  a  variety  of  contexts  would  have  been  equally  suitable.  Other  types  of 
equilibrium  problems,  such  as  those  arising  in  traffic  network  analysis,  have  direct  conceptual 
and  analytic  counterparts  to  those  found  in  economics,  and  are  hence  amenable  to  the  same 
solution  techniques. 

A  state  of  equilibrium  exists  when  competing  or  opposing  forces  are  brought  into  balance. 
One  of  the  major  themes  of  ecomonic  theory  is  that  the  behavior  of  a  complex  economic  sys¬ 
tem  can  be  viewed  as  an  equilibrium  arising  from  the  interaction  of  a  number  of  economic 
units,  each  motivated  by  their  own  special  interest.  General  equilibrium  theory  ([2], [22], [24]) 
seeks  to  determine  the  point  at  which  this  balance  can  be  struck,  and  in  so  doing  focuses  on  the 
interrelationships  that  exist  among  the  markets  for  goods  and  services  in  the  economy.  The 
analysis,  however,  is  carried  out  in  terms  of  individual  decision  makers  and  commodities  rather 
than  in  terms  of  aggregates.  The  fundamental  questions  that  general  equilibrium  theory 
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attempts  to  answer  are  the  same  as  those  posed  in  macroeconomic  theory:  given  different 
economic  environments,  what  goods  will  the  economy  produce,  how  will  these  be  produced, 
and  who  will  obtain  them?  But  where  macroeconomics  provides  answers  in  terms  of  aggre¬ 
gates,  general  equilibrium  theory  provides  answers  in  terms  of  the  individual  consumers,  pro¬ 
ducers,  and  commodities  making  up  these  aggregates. 

Consider,  for  the  moment,  a  model  in  which  m  consumers  are  engaged  in  the  exchange  of 
commodities  which  they  initially  own  and  in  which  production  or  supply  is  ignored.  Suppose 
there  are  n  goods  in  the  economy  and  that  each  of  the  consumer's  preferences  is  represented  by 
a  utility  function.  A  bundle  of  goods  x  is  preferred  to  a  bundle  x'  by  consumer 
i  (i  -  1,  ....  m)  if  and  only  if  u,(x)  >  u,(x')  where  the  utility  function  w,  :  R n  — >  R  is  gen¬ 
erally  assumed  to  be  strictly  concave  and  continuous.  Let  p  €  R"  be  the  vector  of  prices  for 
the  n  goods.  The  demands  of  the  /  th  consumer  are  determined  by  the  solution  to  the  follow¬ 
ing  problem: 

maximize  u,(x) 
subject  to  <p,x>  <  <p,w’> 
x  >0  . 

where  w'€  R"  is  the  initial  wealth  or  resource  endowment  of  the  /  th  consumer,  /  -  1,  ....  m. 
We  shall  assume  that  the  solution  vector  for  this  problem,  d‘(p),  can  be  written  as  a  continuous 
function  of  the  prices  p.  The  individual  trader’s  excess  demand  function  is  d'ip)  —  w' 

(i  —  1 . m)  and  will  be  denoted  by  g'ip).  The  excess  demand  will  be  positive  for  those 

commodities  whose  stock  he  wishes  to  increase  by  exchange  and  negative  for  the  remaining 
items.  If  it  is  assumed  that  all  purchases  are  to  be  financed  solely  by  the  sale  of  assets,  then 
individual  budgetary  constraints  lead  to  the  following  identity: 

0)  P\d'\  (/>)  +  •••  +  P„d‘nip)  -  Piw\  +  ■■■  +  p„w‘n. 

The  market  excess  demand  function  g  :  Rn  —  Rn  is  simply  the  sum  of  the  individual  excess 
demand  functions 

gip)  -  £  id'ip)  -  w'). 

/-i 


An  equilibrium  price  vector  p *  is  one  for  which  all  of  the  market  excess  demands  are  less 
than  or  equal  to  zero  with  a  zero  price  for  any  commodity  whose  excess  demand  is  strictly  less 
than  zero.  This  leads  to  the  formulation  of  the  complementarity  problem  ([8], [17]): 


(2) 

gip )  <  0, 

(3) 

p  >0, 

(4) 

<p,g(p)> 

whose  solution  p*  will  be  the  focal  point  of  this  paper.  Condition  (4),  known  as  the  Walras 
Law  [27],  is  the  aggregated  form  of  Eq.  (1)  and  holds  for  all  price  vectors  p  whether  they  are  in 
equilibrium  or  not. 

We  note  here  that  production  may  easily  be  incorporated  in  this  model  by  either  replacing 
or  augmenting  the  /  th  consumer’s  initial  wealth  w'  by  a  supply  function.  For  individual  /,  this 
function  relates  the  prevailing  market  prices  p  to  the  quantity  of  goods  produced. 


A  number  of  persons,  including  Nash  [20],  Arrow  and  Debreu  [1],  and  Kuhn  [16]  have 
studied  the  existence  problem  of  the  competitive  model  from  the  standpoint  of  combinatorial 
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topology.  The  first  algorithms,  however,  actually  designed  for  computing  economic  equilibria 
were  developed  by  Scarf  [23],  and  were  based  on  a  procedure  for  approximating  a  fixed  point  of 
a  continuous  mapping.  More  recently,  Wilson  [28]  and  Elken  [10]  have  exploited  path 
methods  in  the  pursuit  of  greater  computational  efficiency.  In  a  slightly  different  vein,  Lemke 
[17]  offered  some  constructive  proofs  relating  to  the  existence  of  equilibrium  points  for  bima¬ 
trix  games.  His  work  strongly  suggested  a  computational  scheme  for  models  with  linear  excess 
demand  functions. 

This  paper  presents  an  alternative  procedure  for  computing  equilibria  for  a  class  of  prob¬ 
lems  where  the  excess  demand  function  or  its  logical  equivalent  has  an  explicit  representation 
that  can  be  converted  to  a  separable  form.  Solutions  are  obtained  by  first  recasting  the  com¬ 
plementarity  problem  into  a  nonconvex  minimization  problem  whose  optimal  value  or  best 
upper  bound  is  known  at  the  outset,  and  then  using  Falk’s  [12]  algorithm  to  locate  a  global 
solution.  This  allows  us  to  go  beyond  the  common  linear  formulations  of  an  economy  or  net¬ 
work  (e.g.,  see  Eaves  [8],  Negishi  [21],  or  Asmuth,  Eaves,  and  Peterson  [4])  which,  in  spite  of 
their  outward  simplicity,  must  appeal  to  rather  complicated  algorithms  if  solutions  are  to  be 
obtained. 

The  algorithm  which  we  subsequently  describe  and  use  as  an  alternative  for  solving  Eqs. 
(2-4)  is  based  on  a  branch  and  bound  philosophy,  and  as  such,  computes  a  convergent 
sequence  of  upper  and  lower  bounds  on  the  optimal  value  of  the  problem.  In  our  case,  how¬ 
ever,  because  the  best  upper  bound  on  the  objective  function  is  known  to  be  zero,  the  amount 
of  work  necessary  to  achieve  convergence  is  significantly  reduced.  The  usual  requirement  of 
finding  a  point  that  yields  equality  between  the  best  upper  and  best  lower  bounds  is  replaced  by 
the  simpler  requirement  of  finding  any  point  that  yields  an  objective  value  of  zero. 

In  the  next  section,  the  complementarity  problem  is  reformulated  as  a  nonconvex  minimi¬ 
zation  problem  whose  solution  yields  the  desired  equilibrium  vector.  Next,  the  method  is 
applied  to  a  number  of  sample  problems  and  our  computational  experience  is  detailed.  Here  we 
see  that  the  results  are  obtained  in  a  surprisingly  small  number  of  iterations  of  the  algorithm. 

2.0  REFORMULATION  OF  THE  COMPLEMENTARITY  PROBLEM 

In  the  complementarity  problem  derived  above,  there  is  no  objective  function  to  be 
optimized.  Indeed,  in  many  complex  economic  equilibrium  problems  there  does  not  appear  to 
be  a  "natural"  objective  function  whose  optimization  yields  prices  and  quantities  in  equilibrium 
(see,  e.g.,  Scarf  [24]). 

In  spite  of  this,  consider  the  following  "artificial"  minimization  problem  (P): 

(P)  v*  -  min(-<p,g(p)>  :  g(p)  <  0 ,  p  >0). 

Now  let  p*  be  a  solution  of  the  complementarity  problem  (i.e.,  p *  is  a  vector  of  equilibrium 
prices).  Then  p*  is  feasible  to  problem  (P),  and  yields  a  value  of  0  to  the  objective  function. 
Since  this  objective  function  is  greater  than  or  equal  to  zero  at  all  feasible  points,  v*  «  0.  Con¬ 
versely,  it  is  clear  that  any  solution  of  problem  (P)  for  which  v*  —  0  must  be  a  vector  of  equili¬ 
brium  prices. 

Problem  (P)  is  of  a  nonconvex  nature,  and  in  general,  no  suitable  technique  exists  for  the 
determination  of  a  global,  rather  than  a  local  solution;  however,  if  each  excess  demand  function 
g,,  i  “  1,  2, ....  n,  is  separable,  i.e.. 
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“  L  Sij^Pj).  '-1.2 . n 

y— i 

and  each  is  continuous,  then  problem  (P)  can  be  written  as  a  separable  programming  prob¬ 
lem  whose  approximate  global  solution  can  be  obtained  with  arbitrary  precision. 


We  now  formulate  an  equivalent  problem  with  a  different  objective  function  but  the  same 
constraint  region  whose  optimal  value  is  equal  to  that  of  problem  (P).  The  equivalent  problem 
(P')  is 


(P') 


min, 


X  min(/7,,  -  gj(p))  :  p  >  0,  g{p)  <  Ol 


Rewriting  the  objective  function  in  problem  (P0,  we  get  the  desired  result: 


(S) 


min 

s0>);£0 

p>  0 

—  min 


min 


X (min (0, -&(/>)  -  p,)  +  pj) 


(min(0,»>/)  +  p,) 


g(p)£0 

W  +  p  +  g(p)  —  0 


Wj 


X(min(0,w,)  +  p,) 

/ 

Z  &,  (/>,  )  <  o 

J 

v  +  Pi  +  Z  gijipj)  =  o 


/  —  1 ,  2,  ....  n 


p>  0 


where  the  w,  ’s  will  be  referred  to  as  auxiliary  variables. 


Problem  (S)  is  still  a  nonconvex  programming  problem,  but  its  separable  structure, 
created  at  the  expense  of  a  twofold  increase  in  dimensionality,  makes  its  mathematics  much 
more  tractable.  The  traditional  method  for  treating  separable  problems  involves  calculating 
piecewise  linear  approximations  of  the  associated  functions  and  applying  a  modification  of  the 
simplex  method  to  the  resulting  problem  (see,  e.g..  Miller  [19]).  The  modification  amounts  to 
a  restriction  on  the  usual  manner  of  selecting  variables  to  exchange  roles  (basic  to  nonbasic  and 
vice  versa)  and  will  yield  a  local  but  not  necessarily  a  global  solution  of  the  approximating  prob¬ 
lem. 


An  algorithm  for  finding  global  solutions  of  nonconvex  separable  problems  was  developed 
by  Falk  and  Soland  [13]  and  Soland  [25],  The  method  is  based  on  the  branch  and  bound  philo¬ 
sophy  and  yields  a  (generally  infinite)  sequence  of  points  whose  cluster  points  are  global  solu¬ 
tions  of  the  problem.  The  implementation  of  the  method  is  limited  by  the  necessity  of  comput¬ 
ing  convex  envelopes  111]  of  the  functions  involved,  although  a  number  of  applications  have 
been  shown  possible  when  these  functions  exhibit  special  structures  (e.g.,  concave  or  piecewise 
linear). 

The  inherent  limitations  that  special  problem  structures  impose  have  been  overcome  by 
the  introduction  of  two  algorithms  independently  developed  by  Beale  and  Tomlin  [5]  and  Falk 
[12],  For  this  paper,  we  have  used  the  programming  code  MOGG  based  on  the  algorithm  pro¬ 
posed  by  Falk  and  written  by  Grotte  [151  to  solve  a  number  of  equilibrium  problems.  The 
results  are  presented  in  the  next  section. 
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3.0  COMPUTATIONAL  EXPERIENCE 

A  variety  of  equilibrium  problems  have  been  studied  lo  test  the  approach  outlined  above. 
The  first  is  a  multicommodity,  transshipment  problem  defined  on  an  affine  network,  taken  from 
Asmuth,  Eaves,  and  Peterson  [41  who  used  Lemke's  algorithm  [17]  to  obtain  a  solution.  The 
second  involves  a  simple  competitive  market  comprising  three  producers,  three  consumers,  and 
three  commodities.  The  supply  and  demand  functions  in  this  economy  are  given  a  piecewise 
linear  formulation,  and  three  equilibrium  points  are  known  to  exist.  The  third  problem  is 
identical  to  the  second  except  that  a  majority  of  the  piecewise  linear  functions  have  been  recast 
as  continuous,  smooth  functions.  The  fourth  problem  provides  an  example  outside  the  context 
of  economics,  and  is  derived  from  a  3-node,  4-arc  traffic  network  whose  equivalent  excess 
demand  function  is  both  nonlinear  and  nonseparable. 

The  algorithm  itself  is  based  on  branch  and  bound  techniques  and  considers  subsets  of  a 
linear  polyhedron  containing  the  feasible  region  of  problem  (S).  A  lower  bound  on  the  optimal 
value  of  the  problem  is  found  by  minimizing  the  objective  function  over  each  of  these  subsets 
and  selecting  the  smallest  value  obtained.  A  check  for  the  solution  is  made  which,  if  success¬ 
ful,  yields  a  global  solution  of  the  piecewise  linear  approximation  to  problem  (S).  If  the  check 
fails,  the  subset  corresponding  to  the  smallest  lower  bound  is  further  subdivided  into  either  two 
or  three  new  linear  polyhedra  and  the  process  continues  as  before  with  new  and  sharper  bounds 
being  determined.  The  process  is  finite  and  terminates  with  a  global  solution  of  the  approxi¬ 
mate  problem. 

3.1  Transshipment  on  Affine  Networks 

Economic  equilibria  on  certain  affine,  multicommodity,  transshipment  networks  were  first 
studied  by  the  regional  economists  Takayama  and  Judge  [26]  using  quadratic  programming. 
Recently,  Asmuth,  Eaves,  and  Peterson  [4]  have  constructed  a  more  general  approach  that  util¬ 
izes  the  economic  equilibrium  conditions  directly  without  first  passing  to  a  quadratic  program¬ 
ming  problem.  A  brief  discussion  of  their  model  and  the  solution  to  the  sample  problem 
presented  in  their  paper  follow. 

The  transshipment  problem  can  conveniently  be  represented  by  a  directed  graph  (N,L) 
with  a  finite  number  of  nodes  (members  of  AO  and  links  (members  of  L)  on  which  a  finite 
number  of  commodities  can  be  transported.  Each  node  i  in  N  represents  the  set  of  producers 
and/or  consumers  at  a  specific  spatial  location;  and  each  link  ij  in  L  represents  a  specific  facility 
for  transporting  commodities  from  some  node  /  to  a  different  node  j.  (In  particular,  we  assume 
that  there  are  no  loops;  i.e.,  no  links  connecting  a  given  node  to  itself.)  Each  link  is  aligned  to 
coincide  with  a  direction  of  possible  transport;  therefore,  at  least  two  links  must  connect  nodes 
between  which  commodities  can  be  transported  in  either  direction.  The  nodes  are  enumerated 
in  any  order,  consecutively  beginning  with  one,  as  illustrated  by  the  graph  in  Fig.  1. 

Let  p'd  and  p[  be  n  vectors  denoting  the  unit  demand  price  and  the  unit  supply  price  of  n 
commodities  at  node  /,  and  let  p1'  €  R"  denote  the  cost  of  transporting  each  of  these  n  commo¬ 
dities  over  link  ij.  Affine  relations  are  assumed  between  prices  and  quantities,  i.e., 

(5)  p‘  -  /4'x'+  a'  for  each  node  i  €  N, 

where  p'  -  (pd,p[)T,  A\  and  o'  are  given  constant  matrices  and  vectors  (which  arose  in  [4] 
from  inverting  the  difference  between  given  supply  and  demand  quantities,  originally  expressed 
as  affine  functions  of  price),  and  x'  is  an  n-dimensional  vector  representing  the  excess  quantity 
of  each  commodity  produced  by  node  /.  It  is  also  assumed  that 
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(6)  p'j  —  A‘jx'J  +  a ,J  for  each  link  ij  €  L, 

where  AiJ  and  a‘J  are  given  constant  matrices  and  vectors  (which  arose  from  describing  the 
transport  prices  as  functions  of  transport  volumes),  and  xiJ  €  Rn  denotes  the  quantity  of  n  com¬ 
modities  transported  over  link  ij. 

The  quantities  are  constrained  by  the  nonnegativity  condition 

(7)  xIJ  >  0  for  each  link  ij  6  L 
and  the  commodity  conservation  condition 

(8)  x1  —  ]£  xiJ  —  £  xJI  for  each  node  i  €  N, 

jtN  JIN 

where  x'J  -  0  if  ij  #  L.  Note  that  although  xlj  is  nonnegative  by  virtue  of  the  choice  of  link 
direction,  the  components  of  x1  might  be  positive  or  negative  depending  on  whether  node  i  is  a 
net  exporter  or  net  importer  of  a  particular  commodity. 

The  price  stability  condition  on  p  leads  to  the  following  relationship: 

(9)  p‘J  +  pi  £  Pi  for  each  link  ij  E  L. 

To  see  this,  assume  that  Eq.  (9)  is  violated  for  some  commodity  c.  As  a  consequence  some 
economic  agent  would  find  it  profitable  to  purchase  as  much  of  commodity  c  as  possible  at  node 
/  and  transport  it  over  link  ij  for  resale  at  node  j.  This  would  clearly  be  an  economically 
unstable  situation. 

The  final  relationship  needed  to  establish  equilibrium  is  the  complementarity  condition 

(10)  <  x‘>,  (pJd  -  pij  -  pi)  >  -  0  for  each  link  ij  6  L. 

This  condition  is  imposed  to  ensure  that  no  positive  flow  xiJ  will  occur  on  a  link  if  the  cost 
p'i  +  p'5  of  a  commodity  at  node  j  exceeds  the  price  pi  which  a  consumer  is  willing  to  pay. 

To  conform  with  the  notation  developed  in  Section  1.0,  a  function  g'J  €  R "  equivalent  to 
the  excess  demand  function  but  now  expressed  in  terms  of  prices  rather  than  quantities  will  be 
defined  by  the  following  expression: 

g'1  *Pi~  P°  ~  Pi  for  each  link  ij  €  L. 
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Through  the  appropriate  substitutions  the  solution  ( x,p )  to  the  economic  equilibrium  con¬ 
ditions  (5-10)  can  be  described  entirely  in  terms  of  the  solution  (x,x)  to  t-he  linear  complemen¬ 
tarity  conditions 

(11)  x  >  0  ,  <x,g>  —  0,  g  <  0 

(12)  g  —  —  Mx  -  v 

where  x,  g,  and  v  are  vectors  equal  in  size  to  the  number  of  links  times  the  number  of  commo¬ 
dities,  and  M  is  a  square  matrix  of  comparable  dimension  whose  components  are  given  in  Fig. 
2.  The  constant  v  follows  from  the  substitution  of  Eqs.  (5)  and  (6)  into  Eq.  (10)  and  is  given 
by 

viJ  =  ai  —  aiJ  —  os'  for  each  link  ij  €  L. 


A,  +  A}  +  A’ 

(A,  +  A,) 

-A* 

0 

A? 

v” 

-(A,  +  Aj) 

A,  +  Aj  +  A1 

A? 

0 

■A? 

Vs' 

•A? 

A? 

A2+  A3+  A3 

■A3 

-  Aj 

v” 

0 

0 

-A-, 

A3+  A4  +  A4 

A. 

Aj 

-A, 

•a2 

■A. 

A2  +  A.+  A5 

V’ 

Figure  2.  Constituents  of  matrix  for  sample  network. 


For  purposes  of  illustration,  the  2-commodity,  5-link  network  shown  in  Fig.  1  has  been 
considered  for  analysis.  The  reworked  data  for  this  problem  are  displayed  in  Fig.  3.  If  condi¬ 
tions  (11)  and  (12)  are  now  put  into  the  format  of  problem  (S),  we  get  a  problem  of  the  form 
to 

(A)  min  £  (min(0,w,)  -F  x,) 

*■"  i-i 

w,  —  MjX  -I-  x,  —  v, 

—MjX  <  v,  /  —  1,  2 . 10 

x  >  0 

where  A/,  is  the  /  th  row  of  M. 

The  algorithm  that  is  used  for  the  computations  does  not  solve  the  original  problem  (S), 
but  constructs  an  approximate  problem  to  solve  by  replacing  each  of  the  associated  functions 
with  their  piecewise  linear  convex  envelopes.  A  related  problem  is  simultaneously  introduced 
which  gives  a  sharper  underestimate  of  the  optimal  value  of  the  approximating  problem  than 
does  the  convex  envelope  problem.  It  is  this  related  problem  that  the  branch  and  bound  pro¬ 
cedure  solves  first  to  get  estimates  on  the  optimal  value  of  the  approximating  problem,  and  to 
set  up  new  problems  if  the  estimates  do  not  yield  a  global  solution. 
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Figure  3.  Data  for  affine  network. 


The  functions  defining  the  constraint  region  of  problem  (A)  are  all  linear  and  hence  con¬ 
vex,  and  therefore  will  not  be  replaced  in  the  approximate  problem.  The  functions  associated 
with  the  nonlinear  variables  w,  in  the  objective  function  (i.e.,  min(0,tv,),  i  -  1,  ....  n)  are 
piecewise  linear,  but  concave,  and  will  be  replaced  in  the  approximate  problem  by  their  convex 
envelopes,  which  in  this  case  are  straight  lines.  This  is  illustrated  in  Pig.  4. 


min  (0,w. ) 


Convex  Envelope 


Figure  4.  Convex  envelope  of  mintO.w,). 


The  branch  and  bound  technique  proceeds  to  divide  the  domain  of  these  functions  into 
pieces  corresponding  to  their  linear  segments  and  separately  solves  the  set  of  related  problems 
in  which  the  nonlinear  variables  are  respectively  limited.  When  every  function  is  piecewise 
linear,  as  is  the  case  with  problem  (A),  we  get  an  exact  solution  to  the  original  problem. 

It  is  customary  with  branch  and  bound  methods  to  describe  the  algorithm  in  terms  of  a 
branch  and  bound  tree.  The  nodes  of  the  tree  correspond  to  the  related  linear  subproblems, 
while  the  branches  of  the  tree  correspond  to  the  set  on  which  the  branching  variables  are 
defined.  A  solution  is  obtained  when  the  best  upper  bound  at  any  node  is  equal  to  zero.  That 
is,  any  feasible  point  yielding  a  subproblem  value  of  zero  necessarily  satisfies  the  equilibrium 
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conditions  for  problem  (A),  no  tree  developed  because  the  solution  was  obtained  on  the  first 
iteration  of  the  algorithm  The  optimal  vector  x  is  given  by 

x  •'  -  <0.2353,  0.7059)  x21  -  (0.0,  2.2941)  x23  -  (1.5294,  0.0) 

xu  -  tl  0098,  0.2151)  x42  -  (0.0,  0.0). 

The  tormulation  of  problem  (A)  required  the  addition  of  10  auxiliary  variables  to  the  ori¬ 
ginal  set  of  10  linear  variables  The  former  were  each  divided  into  two  intervals  for  the  pur¬ 
poses  of  branching  This  division,  corresponding  to  the  segments  of  the  piecewise  linear  func¬ 
tions  defined  for  these  variables,  implied  that  any  branch  and  bound  tree  produced  by  the  algo¬ 
rithm  could  be  at  most  10  branches  deep  and  that  no  variable  could  appear  more  than  once 
along  any  path  In  theory,  it  might  have  been  necessary  to  solve  up  to  211  -  1  subproblems 
before  reaching  a  solution,  however,  the  fact  that  the  first  subproblem  produced  an  equilibrium 
point  underscores  the  computational  efficiencies  that  result  from  having  available  at  the  outset  a 
means  of  independently  checking  each  iteration  for  convergence. 

Each  subproblem  solved  by  MOGG  is  a  linear  program.  When  the  excess  demand  func¬ 
tions  are  affine,  the  solution  vector  necessarily  yields  a  feasible  point  to  the  original  problem. 
The  upper  bound  associated  with  this  feasible  point  will  always  be  greater  than  or  equal  to  zero, 
but  generally  not  correspond  to  an  equilibrium  solution.  Mangasarian  [18]  has  shown  that  the 
linear  complementarity  problem  is  equivalent  to  a  linear  program  whose  cost  coefficients  are 
dependent  upon  the  structure  of  M.  The  similarity  between  these  linear  programs  and  those  set 
up  by  MOGG  when  M  meets  certain  conditions  admits  the  possibility  that  MOGG  will  produce 
an  equilibrium  point  on  the  first  iteration.  Although  these  conditions  might  logically  arise  in 
some  economic  contexts,  they  were  not  present  in  this  example  and,  hence,  did  not  influence 
the  rate  of  convergence. 


3.2  A  Piecewise  Linear  Market 

This  example  [7]  provides  a  simple  explanation  of  how  a  competitive  market  operates.  As 
is  common  in  microeconomic  theory,  we  will  distinguish  among  individuals  according  to  the 
economic  functions  that  they  perform  or  on  the  basis  of  the  kinds  of  decisions  they  make. 
Thus,  a  consumer  is  an  individual  (or  unit)  that  consumes  commodities  and  supplies  inputs  to 
production.  The  role  of  the  consumer  may  be  defined  as  that  of  choosing  from  among  the 
alternative  commodity  bundles  available  to  him.  Similarly,  a  producer  is  an  individual  (or 
group)  that  utilizes  inputs  to  produce  commodities.  The  role  of  the  producer  may  be  character¬ 
ized  as  that  of  choosing  from  among  the  alternative  input-output  patterns  available  to  him.  The 
same  individual  might  appear  in  the  economy  both  as  a  consumer  and  as  a  producer.  Once  the 
choices  are  made,  a  state  of  the  economy  is  defined. 

Under  certain  assumptions  (see  Quirk  and  Saposnik  [22]),  for  a  market  that  contains  n 
commodities,  m  consumers,  and  I  producers,  the  aggregated  (net)  amounts  of  commodities 
demanded  and  supplied  for  any  vector  of  prices  can  be  determined  by  a  simple  summation  of 
the  amounts  demanded  and  supplied  by  individual  consumers  and  producers.  Thus,  given  the 
price  vector  p ,  where  p  -  (p\,p2 . p„ ),  we  can  write  xit(p)  as  the  amount  of  the  ith  commo¬ 

dity  consumed  (or  supplied  as  an  input  in  production)  by  the  j  th  individual  at  the  set  of  prices 
given  by  p  ;  and  ylk(p )  as  the  amount  of  the  /  th  commodity  produced  (or  used  up  as  an  input 
in  production)  by  the  k  th  firm  at  the  set  of  prices  given  by  p.  Then,  the  aggregate  (net)  con¬ 
sumption  of  commodity  i  by  consumers  is  given  by 
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Xj(p)  -  X  i  -  1.  2 . n 

J- 1 

and  the  aggregate  (net)  production  of  commodity  i  by  producers  is  given  by 

i 

yt(p)  -  L  yAp')-  i  -  l,  2,  ....  n. 

k- 1 

We  then  define  x(p)  and  y(p)  as  point-to-point  mappings  from  Rn  into  itself.  In  the  absence 
of  any  initial  endowment  the  excess  demand  function  can  be  written  as  g(p)  -  x(p)  -  y(p). 

The  sample  economy  under  consideration  contains  three  commodities,  three  consumers, 
and  three  producers.  The  associated  supply  and  demand  functions  are  assumed  to  be  piecewise 
linear,  and  are  given  in  graphic  form  in  Figs.  5  and  6.  To  conform  with  the  presentation  in  [7], 
the  equilibrium  quantity  rather  than  the  equilibrium  price  will  be  computed.  The  following 
notation  will  be  used: 

ps  —  j  th  producer’s  supply  price  for  commodity  /', 
pdj  —  j  th  consumer’s  demand  price  for  commodity  /, 

x,  —  quantity  of  i  th  commodity  consumed, 

y,  —  quantity  of  /  th  commodity  produced, 

where  /,  J  -  1,  2,  3,  ps  -  X  A.  ‘s  a  function  of  the  consumption  variable  x  and  pd  -  £  pd 

J  '  i 

is  a  function  of  the  production  variable  y. 

An  equilibrium  point  will  exist  if  the  following  conditions  are  satisfied: 

(13)  x  >  0,  y  >  0, 

(14)  x  -  y~  0, 

(15)  Pd  ~  P$  <  0. 

(16)  <x,  pd  -  ps>  =  0, 

where  ps  and  pd  are  the  three-dimensional  market  supply  price  and  market  demand  price  vec¬ 
tors.  The  first  condition  assures  feasibility;  the  second  condition  clears  the  market;  the  third 
condition  assures  price  consistency  by  requiring  the  excess  demand  function  to  be  less  than  or 
equal  to  zero;  and  the  fourth  condition  is  Walras’  Law  and  reflects  the  following  circumstances: 
if  X/,  the  quantity  of  the  i  th  commodity  being  purchased,  was  positive  and  if  the  producers’ 
supply  price  ps  was  greater  than  the  consumers’  demand  price  pd ,  then  the  producers  would  be 
losing  money  and  would  begin  to  lower  y,(—  x,)  to  zero.  Such  a  situation  would  be  economi¬ 
cally  unstable. 

Conditions  (13)  — (16)  can  now  be  written  as  a  minimization  problem  in  the  form  of  prob¬ 
lem  (P): 

min{<-x,  (pd-p,)>) 

Pd~  Ps  <0 
x  >  0 
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In  order  to  recast  this  problem  in  the  form  of  problem  (S),  the  cor  imption  and  production 
data  given  in  Figs.  5  and  6  must  be  aggregated  over  their  respective  agents  to  obtain  the  market 
demand  and  supply  curves  p d  and  ps.  This  has  been  done  for  each  of  the  three  commodities. 

COMMODITY  1: 

j—  2x\  +  18;  X\  <  1 

Pd\  “  {-0.5x,  +  16.5;  x,  >  1  +  (6  ~  3x2)  +  (5  _  2xj) 
f  1  lx,;  X!  <  1 

“  \7.5x,  -  3.5;  x  >  l)  +  (2  +  X2>  +  (2  +  2x3> 

COMMODITY  2: 

pdl  -  (12  -  5x,)  +  (14  -  2x2)  +  (10  -  5xj) 

1 1  2x2  ;  x2  <  1 

^"4  +  {3x2  +  9;x2  >  lj  +  3 

COMMODITY  3: 

— x3  +  8;  x3  <  1 

pdj  =  2  +  (5  -  2x2)  +  — 2.5jc3  +  9.5;  x3  >  1 

|0;  *i  <  1  (5x3;  x3  <  lj 

“  }3x,  -  3;  x,  >  1  +  (3  +  Xj)  +  |0.5x3  +  4.5;  x3  >  lj 

The  minimization  problem  in  its  separable  form  becomes 

(B)  min  £  {min(0,w,)  +  x,) 

*-w  /-! 

w,  +  Pd,  ~  Ps,  +  X,  =  0 

»-  1,2,3 

Pd,  -  Ps,  <  0 
x  >  0 

where  pd  and  are  defined  above. 

Each  of  the  six  variables  in  problem  (B)  is  nonlinear,  the  first  three  (tvi,w2,*v3) 
corresponding  to  the  auxiliary  variables  and  the  second  three  (xi,x2,x3)  to  the  original  problem 
variables.  The  associated  functions  are  piecewise  linear  and  contain  at  most  one  break  point. 
This  means  that  the  branch  and  bound  tree  can  be  at  most  six  nodes  deep  and  that  a  maximum 
of  27  -  1  subproblems  might  have  to  be  solved.  Once  again  though,  the  algorithm  converges 
on  the  first  iteration.  The  computed  best  upper  bound  for  the  first  subproblem  is  zero  and 
hence  the  solution. 

If  the  algorithm  is  permitted  to  run  past  this  point  until  its  usual  termination  conditions 
are  met  a  total  of  19  subproblems  will  be  solved.  Figure  7  depicts  the  resultant  branch  and 
bound  tree  which  serves  to  illustrate  both  the  advantage  of  knowing  the  optimal  value  at  the 
outset  and  the  amount  of  work  required  to  search  for  alternative  solutions.  The  known  results 
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Figure  7.  Branch  and  bound  tree  for  piecewise  linear  market  when  forced  past  solution 
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are  corroborated  at  node  9.2  where  roundoff  errors  have  produced  a  best  upper  bound  of  -0.2 
x  10-4  and  a  best  lower  bound  of  0.1  x  10-5,  a  minor  contradiction.  The  two  numbers  adja¬ 
cent  to  each  node  represent  the  best  upper  and  lower  bounds  for  that  subproblem.  A  bar  in  the 
place  of  the  best  upper  bound  indicates  that  no  corresponding  feasible  point  to  the  approximate 
problem  exits.  The  numbers  along  the  branches  refer  to  the  branching  variables  associated  with 
the  preceding  node,  and  the  +  and  -  signs  indicate  whether  the  particular  auxiliary  variable  was 
permitted  to  range  over  the  set  of  positive  real  numbers  or  negative  real  numbers,  respectively. 
The  bars  appearing  below  the  nodes  indicate  that  either  the  lower  bounds  of  the  associated  sub¬ 
problems  are  all  greater  than  the  current  best  upper  bound  or  that  they  are  infeasible  and, 
therefore,  cannot  contain  the  solution. 

In  terms  of  the  actual  variables,  the  solution  vectors  are  x*  =  (2,1,1)  and  w* 
=  (—2,  — 1,-1).  From  the  equality  constraints  in  problem  (B),  it  can  be  seen  that  x*  +  w*  —  0 
whenever  the  corresponding  excess  demand  functions  are  binding.  By  tracing  the  convergent 
path  backwards  from  node  9.2  to  node  0.0  we  see  that  the  branches  that  fall  along  this  path 
correspond  to  the  nonpositive  orthant  of  w.  The  subscripts  attached  to  the  branching  variables 
in  the  tree  denote  the  (closed)  intervals  over  which  the  original  problem  variables  are  defined 
for  all  subsequent  subproblems. 

3.3  A  General  Market 

The  separable  programming  algorithm  works  by  first  replacing  each  of  the  original  prob¬ 
lem  functions  with  their  piecewise  linear  convex  envelopes,  and  then  creating  a  new  problem  to 
solve  as  an  ultimate  approximation.  From  this  approximate  problem  a  series  of  convex  sub¬ 
problems  issue  that  are  set  up  and  solved  under  the  branch  and  bound  philosophy.  If  the  origi¬ 
nal  functions  are  all  piecewise  linear  (but  not  necessarily  convex),  then  solving  the  aggregate  of 
subproblems  is  tantamount  to  solving  the  original  problem  exactly.  Such  was  the  case  in  both 
the  first  and  second  examples.  In  this  example,  four  of  the  piecewise  linear  functions  in  Fig.  5 
and  6  have  been  replaced  with  smooth  counterparts.  The  new  functions  were  constructed  to 
pass  through  the  points  ’,  )  and  (2,  ),  and  are  given  by 

1-003175.,) 

Pju  =  16.516  e 

pJ}1  =  —0.75  Xj  -  0.25  x3  +  8. 
p,u  -  -3.75  x,2  +  14.75  x„ 
p,n  -  17.31234  log(x20  463  +  1  ). 

Substituting  these  functions  for  the  originals  in  problem  (B)  leads  to  a  new  minimization  prob¬ 
lem  that  can  be  written  as 

3 

(C)  min  £  (min(0,w,)  +  x,) 

subject  to 

w,  +  3.75  X\  —  13.75.ri  +  16.516  e  °  1  —  4x2  —  4x5  =  —  7 

w2  -  5x|  -  17.31234  log(x20463  +  1)  -  x2  -  5x3  =*  -  29 

jo;  X|  <  1  (0.75x2  +  4.25x3  -  8;  x3  <  1 1 

14,3  ~  \3x,— 3;  x,  >  l|  ~  3x2  "  |o.75x32  -  0.2Sx3  -  3.5;  x3  >  if 

3.75  x,2  -  14.75  x,  -I-  16.516  ^<-°03,7S "i'  _  4*2  _  4x,  <  -  7 


-  -  4 
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-  5x,  -  17.31234  logOcf463  +  1)  -  2x2  -  5x3  <  -  29 

0.75x3  +  5.25x3  -  8;  x3  <1 
0.75x3  +  0.75x3  -  3.5;  x3  >  1 


0;  x  <  1 
3x)— 3;xi  >  1 


—  3x2  — 
X[  >  0,  x2  >  0,  x3  >  0. 


<  -  4 


The  number  of  cuts  required  to  approximate  a  piecewise  linear  function  exactly  is  equal  to 
the  number  of  segments  constituting  that  function.  When  the  function  is  smooth,  it  cannot  be 
represented  exactly  by  a  finite  number  of  linear  segments  but  can  be  approximated  with  arbi¬ 
trary  precision  by  increasing  the  number  of  cuts.  In  this  example,  six  cuts  were  made  in  the 
original  problem  variables  (xi,x2,x3)  over  the  closed  interval  [0,3].  Because  the  cuts  were 
evenly  spaced  every  half  integer,  and  the  graphs  of  the  smooth  functions  pass  through  the  solu¬ 
tion  points  of  problem  (B),  it  is  reasonable  to  expect  that  the  solution  to  problem  (C)  would 
coincide  with  one  or  more  of  these  points.  This  indeed  was  the  case:  the  identical  solution 
xt*  —  (2,1,1)  resulted  for  problem  (C).  The  associated  branch  and  bound  tree  is  shown  in 
Fig.  8.  The  algorithm  is  seen  to  have  converged  in  the  tenth  stage  at  node  10.1  after  22  sub¬ 
problems  have  been  solved.  This  contrasts  with  the  first  two  examples  where  the  solution 
occurred  on  the  first  iteration;  however,  in  each  of  these  three  problems,  the  first  feasible  point 
produced  by  the  algorithm  resulted  in  the  solution.  Finally,  we  observe  from  Fig.  8  that  at  the 
tenth  stage,  the  best  upper  bound  and  the  best  lower  bound  are  nominally  equal,  implying  that 
the  general  conditions  for  optimality  have  been  satisfied,  so  the  algorithm  is  terminated.  If  an 
equilibrium  point  had  not  yet  been  found  at  this  stage,  it  would  have  been  reasonable  to  con¬ 
clude  that  none  existed  for  the  given  model.  The  other  two  equilibrium  points  were  not 
uncovered. 

3.4  Network  Traffic  Flow 

The  model  of  the  road  system  considered  here  derives  from  the  notion  that  there  exists  a 
large  community  of  users,  each  of  whom  takes  the  quickest  route  available,  given  the  actions  of 
other  users.  The  number  of  trips  taken  is  assumed  to  depend  on  the  time  required  to  make  a 
trip,  while  the  travel  time  on  a  particular  road  is  assumed  to  depend  on  traffic  volume.  The 
example  that  we  will  investigate  was  studied  by  Asmuth  [3],  who  used  stationary  point  theory 
in  conjunction  with  the  Eaves-Saigal  algorithm  [9]  to  obtain  a  solution.  As  will  be  seen,  the 
traffic  flow  problem  closely  resembles  the  multicommodity  network  presented  in  the  first  exam¬ 
ple. 


To  formulate  the  model,  consider  a  directed  network  (N,A)  with  nodes  /  in  N  and  arcs  ij 
in  A.  For  each  arc  ij,  we  are  given  a  delay  function  which  expresses  travel  time  on  arc  ij  as 
a  function  of  the  traffic  flows  on  the  arcs  of  the  network.  The  travel  time  along  arc  ij  will 
necessarily  depend  on  the  flow  on  that  arc,  but  may  as  well  depend  on  flows  along  other  arcs. 
For  example,  a  two-way  street  could  be  modeled  as  a  pair  of  opposing  arcs  where  the  flow  on 
one  of  the  arcs  might  affect  the  travel  time  on  the  other. 

For  each  pair  of  distinct  nodes  /  and  k  we  are  also  given  a  travel  demand  function  g,k 
which  expresses  demand  for  travel  from  /  to  k  as  a  function  of  travel  times  between  nodes  on 
the  network.  Demand  for  travel  from  >  to  k  will  depend  on  travel  time  from  ;  to  k  as  well  as  on 
travel  times  between  other  pairs  of  nodes;  for  instance,  for  i  to  some  alternate  destination. 

Numerous  solution  procedures  have  been  proposed  for  computing  equilibrium  traffic  flows 
and  travel  limes  on  the  network.  When  /  is  integrable  and  g  has  an  inverse  which  is  integrable, 
the  usual  approach  has  been  to  reformulate  the  equilibrium  problem  as  a  convex  programming 
problem.  These  conditions  will  be  met  if  each  /'  depends  only  on  the  total  flow  along  arc  //, 
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and  each  g,  k  is  monotonically  decreasing  and  depends  only  on  the  travel  time  from  node  /  to 
node  k.  Beckman  (6),  Florian  and  Nguyen  1 14],  and  others  have  addressed  this  problem  under 
comparable  conditions. 

In  practice,  the  demand  and  delay  functions  /  and  g  are  at  best  empirical  fits  and  can  be 
endowed  with  these  or  any  other  restrictions  which  may  seem  useful.  Asmuth’s  approach  does 
not  depend  on  such  restrictions,  but  only  requires  that  the  delay  functions  ftJ  be  positive  on 
each  arc,  that  the  travel  demand  functions  g,k  be  nonnegative  and  bounded  for  each  pair  of 
nodes,  and  that  the  network  be  complete;  that  is,  a  directed  path  must  exist  from  every  node  to 
every  other  node.  From  a  strictly  analytic  point  of  view,  we  will  require  only  that  it  be  possible 
to  put  the  functions  /  and  g  into  a  separable  form.  However,  if  the  model  is  to  accurately 
reflect  the  properties  of  the  system  it  might  be  desirable  to  adopt  the  above  restrictions. 

The  mathematical  conditions  for  a  user  equilibrium  are  presented  below.  The  travel  time 
from  node  i  to  node  k  will  be  written  as  tik  and  the  flow  on  arc  ij  with  destination  k  will  be 
written  as  y„k.  It  will  be  said  that  the  travel  time  vector  t  and  flow  vector  y  are  in  equilibrium 
if  the  following  conditions  hold: 


(17) 

g,.kU)  =  Z  y.j.k  -  Z  y-n.k 

j  j 

i^k,  i.  k(.N 

(18) 

y  >  0 

(19) 

t,.k  <  4  O')  +  t,.k 

i^k,  iji.A,  k€N 

!k,k  =  0 

k€N 

(20) 

>'//.*(40')  +  h.k  -  ”  o 

i&k.  ij€A,  k€N 

(21) 

y„  -  Z  y>ik 

ij  6  A 

k 


Condition  (17)  is  the  conservation-of-flow  equation.  It  says  that  the  traffic  leaving  node  (with 
destination  k  is  the  sum  of  the  traffic  arriving  at  node  i  with  destination  k  and  the  traffic  ori¬ 
ginating  at  /  with  destination  k.  Condition  (18)  says  that  traffic  flows  cannot  be  negative. 

Conditions  (19)  and  (20)  require  that  traffic  flow  be  by  the  fastest  route  available.  In  con¬ 
dition  (19)  we  require  that  tik  not  exceed  the  minimum  travel  time  from  /  to  k,  given  the  flows 
y  on  the  network;  condition  (20)  limits  the  traffic  to  those  routes  which  achieve  this  minimum 
travel  time.  Together,  conditions  (19)  and  (20)  imply  the  principle  of  minimum  travel  time. 
This  says  that  if  any  traffic  flows  from  i  to  k ,  that  is,  if  £  y,l  k  >  0,  then 

/ 

i,.k  -  min  (4 O')  +  tik). 

J 

Equation  (21)  relates  the  basic  flows  to  the  total  arc  flows. 

It  may  be  useful  to  think  of  this  system  as  a  multicommodity  network,  where  all  of  the 
traffic  destined  for  a  particular  node  k  is  a  separate  commodity,  all  of  which  must  be  shipped  to 
node  k  via  the  network.  In  this  way  g,.k(t)  is  the  amount  of  commodity  k  which  must  travel 
from  node  i  to  node  k.  This  trip  will  traverse  a  path  of  arcs  from  i  to  k. 

Conditions  (17-21)  can  be  put  in  the  form  of  problem  (P)  as  follows: 

min  L  Z  <  (-/„(>)  -  'i.k  +  c*)  > 

l  iCV  ,t  C  A 


COMPUTING  tQUILIBRIA  VIA  NONCONVKX  PRtXiRAMMING 


251 


subject  to 


~f,  -  +  'a  <  0 

1  ^  *,  y€/4,  A€/V, 

*a  “  Z  >//.*  +  L  T/,.a  “  0 
/  / 

/  ^  fc.  /,  )c  6  /V, 

-  L  >«.* " 0 

ij  €  +. 

* 


Rewriting  this  problem  in  the  form  of  problem  (S)  we  get 

min  £  I  (minfO.w,,*)  +  y,,.*) 

* „°  ni  a 


subject  to 


_  +  '/.*  +  y>,.k 

-  0 

i&k,  ij€A,  k€N, 

~  fli  ~  '/.*  +  '(.* 

<  0 

i  k.  ij  €  A,  k  6  N, 

X  -Pi/.a  4-  ^  T/;.* 

/  / 

=  0 

i  ^  k,  i,  k€N, 

=  0 

ij  €  +. 

The  following  sample  problem  is  from  [4)  and  is  based  on  the  directed  network  shown  in 
Fig.  9.  Here  N  —  1 1,2. 3)  and  A  =  (12,21,23,31),  where  arcs  12  and  21  represent  a  two-way 
street.  The  delay  functions  are 

/12(y)  -  10  +  e",r'°'  +  1.25  log(y2l +1.0), 

/2l(y)  -  10  +  e**21'10'  +  1.25  log(yl2+l.O), 

fa(y)  -  4  +  e,r«",2,> 

/ji(y)-4  +  /r,l“20’. 

where 


y„  “ 


and  the  travel  demand  functions  are 


*i.2(/) 


*2.1  <*> 


Sl.ill) 


80 

'l.2  +  * 
120 

' 1 . .)  +  1 
40 

'2.1  +  1 
100 

*2.1  +  1 
80 

'2.J+  1 
20 


if  '2.1  >  h.b 
if  '2.1  <  '2  J. 

if  '2. 1  £  '2  + 
if  '2.1  <  '2.3. 


Figure  9  Sample  traffic  network. 

60 

h.  1  +  1  ’ 

100 

h.2  +  1 

When  more  than  one  function  value  is  given  at  a  particular  point  (e.g.,  r2  i  “  1 2.3)-  the 
value  of  g  is  the  average  of  the  two  values.  In  this  case  some  of  the  travelers  from  node  2  will 
go  to  either  1  or  3  depending  on  which  is  closest.  If  the  travel  times  are  equal  then  those  trav¬ 
elers  who  want  to  go  to  either  1  or  3  will  be  divided  between  the  two  destinations. 

In  their  present  form,  the  demand  functions  g2 ,\  and  g2  3  exhibit  an  implicit  dependency 
on  the  travel  times  r2,i  and  *2.3  and  therefore  must  be  made  separable  before  the  equilibrium 
problem  can  be  solved.  Although  this  cannot  be  done  explicitly,  the  desired  result  can  be 
achieved  by  considering  the  following  three  disjoint  partitions  of  t : 

h.\  <  *2.3.  *2.1  “  *2.3-  *2.1  >  *2,3' 

The  mathematical  program  associated  with  each  of  these  partitions  comprises  26  variables  and 
27  constraints.  Of  the  26  variables,  12  are  of  the  type  required  to  achieve  separability  of  the 
functions  while  the  remainder  are  defined  in  the  original  problem  statement. 

The  solution  was  uncovered  in  the  third  partition  at  the  84th  stage  after  168  subproblems 
had  been  solved,  and  once  again,  coincided  with  the  first  feasible  point  found.  The  resultant 
branch  and  bound  tree  is  not  displayed  because  of  its  extensive  length,  but  the  final  computa¬ 
tions  are  highlighted  in  Table  1  along  with  the  results  obtained  by  the  Eaves-Saigal  algorithm. 
The  minor  discrepancies  observed  between  the  variable  and  functional  values  computed  by 
either  method  can  be  attributed  to  the  grid  size  superimposed  on  the  algorithm  and  are,  hence, 


*3.i(')  “ 
*3.2(*)  “ 


r"  '■  atm- 
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Table  1  —  Results  for  Traffic  Network  Problem 


Eaves-Saigal 

Algorithm 

MOGG 

i.k 

ti.k 

‘i.k 

8i.k(t) 

1,2 

19.30 

3.94 

19.45 

3.91 

1,3 

28.43 

4.08 

28.58 

4.06 

2,1 

13.22 

2.81 

13.21 

2.81 

2,3 

9.13 

7.90 

9.13 

7.90 

3,1 

4.09 

11.79 

4.09 

11.79 

3,2 

23.38 

4.10 

23.54 

4.07 

ij.k 

y>t.k  f ij 

yij.k  fij 

12,2 

8.04 

19.30 

7.99 

18.68 

12,3 

4.08 

19.30 

4.06 

18.68 

21,1 

1.15 

13.22 

1.21 

13.21 

21,3 

0.00 

13.22 

0.00 

13.21 

23,1 

1.66 

9.13 

1.60 

8.76 

23,3 

11.97 

9.13 

11.96 

8.76 

31,1 

13.46 

4.09 

13.41 

4.08 

31,2 

4.10 

4.09 

4.08 

4.08 

subject  to  control.  Finer  resolution  is  strictly  a  matter  of  increasing  the  number  of  grid  points 
prescribed  for  the  original  nonlinear  variables  and  solving  a  proportionately  larger  problem. 

4.0  CONCLUSIONS 

The  computation  of  equilibria  plays  a  major  role  in  the  analysis  of  economic  and  transport 
systems.  Whenever  the  equilibrium  problem  can  be  formulated  as  a  set  of  complementarity 
equations,  we  have  shown  for  those  cases  where  the  original  functions  are  implicitly  separable, 
that  nonconvex  programming  can  be  used  to  obtain  a  solution  to  either  problem.  A  general 
algorithm  based  on  branch  and  bound  techniques  was  adapted  to  perform  the  equilibrium  com¬ 
putation.  Unlike  the  usual  nonconvex  program  though,  where  the  solution  is  recognized  only 
when  equality  is  achieved  between  the  best  upper  and  best  lower  bounds,  an  independent  check 
can  be  made  for  the  solution  at  each  iteration  because  the  best  upper  bound  is  known  at  the 
outset.  As  our  computational  experience  demonstrates,  this  enhancement  markedly  increases 
the  efficiency  of  the  algorithm. 

However,  the  fact  that  a  numerical  procedure  will  terminate  with  the  correct  answer  in  a 
finite  number  of  iterations  is  no  guarantee  that  it  will  be  of  any  practical  use.  The  combination 
of  method  and  algorithm  under  study  derives  its  tentative  usefulness  from  the  observation  that 
for  most  problems  investigated,  convergence  occurred  in  a  far  smaller  number  of  iterations 
than  theoretically  possible.  The  results  have  been  especially  encouraging  for  problems  of  larger 
dimensions;  and  in  all  cases,  the  equilibrium  solution  coincided  with  the  first  feasible  point 
found  by  MOGG. 

The  affine  equilibrium  problem  or  linear  complementarity  problem  holds  a  particular 
interest  because  of  its  unique  structure  and  implicit  relationship  to  an  equivalent  linear  pro¬ 
gram.  Because  of  the  similarity  between  the  first  subproblem  set  up  by  MOGG  and  this  linear 
program,  immediate  solutions  are  often  obtainable  from  MOGG  at  little  extra  cost.  In  fact,  the 
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additional  work  required  to  determine  the  equivalent  linear  program,  even  for  relatively  small 
problems,  is  often  more  burdensome  and  more  computationally  expensive  than  permitting 
MOGG  to  run  beyond  its  first  subproblem  to  a  point  of  convergence.  A  further  and  decided 
advantage  of  MOGG  is  that  it  will  solve  all  affine  equilibrium  problems  regardless  of  their 
matrix  structure.  By  contrast,  the  majority  of  alternative  procedures  available  are  limited  in 
their  application  to  a  number  of  special  cases  which  do  not  necessarily  arise  in  practice. 
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ABSTRACT 

We  consider  here  stochastic  linear  programs  with  simple  recourse  when  all 
the  elements  of  the  technology  matrix  and  the  resource  vector  have  certain 
specific  distributions.  The  distributions  considered  are  the  Normal,  Exponential 
and  Erlang.  For  the  first  two  instances  we  extend  the  equivalent  deterministic 
program  to  include  the  variance  of  the  recourse.  Finally,  a  simple  example  is 
given  to  illustrate  the  application  of  the  formulas  for  the  Erlang  case. 


1.  INTRODUCTION 

Recently,  Stancu-Minasian  and  Wets  [11]  published  a  bibliography  of  well  over  700  arti¬ 
cles  dealing  with  various  aspects  of  the  theory  and  applications  of  stochastic  programming.  The 
applications  listed  were  from  such  diverse  areas  as  economic  planning  [12,13]  to  water  storage 
[7,9]  to  more  classical  areas  like  production  planning  [3,10]  and  inventory  [4,8].  Needless  to 
say,  stochastic  programming  has  flourished  considerably  in  the  last  two  decades  and  consider¬ 
able  advances  have  been  made  both  in  theory  and  applications. 

In  an  earlier  paper  [6]  we  provided  formulas  for  some  special  cases  of  stochastic  programs 
with  simple  recourse.  In  this  paper  we  continue  along  the  same  lines  and  present  some  more 
expressions  for  the  Exponential,  Erlang  and  Normal  cases.  The  motivation  for  this  research  is 
to  provide  deterministic  equivalent  nonlinear  programs  for  a  variety  of  cases,  so  that  research¬ 
ers  can  use  the  nonlinear  programs  directly  after  appropriate  parameter  estimation.  The  Normal 
and  the  Erlang  cases  are  particularly  interesting  since  a  large  number  of  unimodal  distributions 
can  be  approximated  by  these  two  distributions. 

2.  STOCHASTIC  PROGRAM  WITH  SIMPLE  RECOURSE 

We  consider  the  following  stochastic  program: 

Min  Z  =  cx  +  E  Q(x,0 

X  ' 

such  that 


Q(x,0  -  ‘/iT+  +  <?2>  . 


‘The  paper  was  written  while  the  author  was  Associate  Professor  of  Management  Science,  Bradley  University .  Peoria 
Illinois.  ___ 
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y+  -  y~  -  Tx  -  p  , 

Ax  -  b.  x  >  0,  y+  >  0.  y~  >  0,  ( 7> ), 

where  A,  b,  and  c  are  (m(  x  n ),  (m,  x  1),  and  (1  x  n)  fixed  arrays,  x  is  the  ( n  x  1)  decision 
vector,  and  qx  and  q 2  are  the  (1  x  m)  unit  penalty  vectors.  We  assume  that  £  =  (pT)  and  p 
and  T  are  random  (m  x  l)  and  {m  x  n)  arrays.  Our  objective  here  is  to  obtain  explicit  formu^ 
las  for  Z  under  specific  assumptions  about  the  random  arrays  p  and  T. 

A  number  of  authors  have  studied  this  problem  and  it  is  well  known  that  if  q]  +  q2  >  0 
the  problem  is  bounded  and  Z  is  convex  in  x  (see  Williams  115]  and  Beale  [1]). 


The  problem  may  be  interpreted  in  two  stages.  At  the  first  stage  we  determine  an 
x  €  K  =  {x\Ax  =  b,x  >0)  and  at  the  second  stage,  after  we  observe  T  and  p,  a  recourse 
(y+  or  y-)  is  uniquely  determined.  The  objective  here  is  to  select  an  x  6  K  which  minimizes 
the  sum  of  the  cost  of  x  at  the  first  stage  and  the  expected  cost  of  the  recourse  at  the  second 
stage.  Q(x,£)  is  the  random  variable  representing  the  cost  of  the  recourse  under  policy  x,  and 
the  expectation  is  taken  with  respect  to  £. 

Ziemba  [16],  Beale  [1],  and  Wets  [14]  give  explicit  formulas  for  Z  but  only  when  p  is 
random.  In  [6]  we  give  formulas  for  Z  when  the  elements  of  T  and  p  are  independent 
exponentials  and  laterally  shifted  exponentials.  In  the  same  paper  we  alscT consider  the  case 
where  the  elements  of  p  and  T  are  independent  Erlangs  with  shape  parameters  equal  to  two.  A 
formula  for  the  distribution  of  V  where  V  -  £x  +  Q(x,  £)  is  also  given  for  the  "independent 
exponentials"  case. 

In  this  paper  in  Section  3  we  consider  the  case  where  the  elements  of  T  and  p  have  a 
multivariate  Normal  distribution  in  Rmin+i)  and  derive  expressions  for  the  expected  value  and 
variance  of  0(x,^).  In  Section  4  we  develop  these  expressions  for  the  case  where  the  elements 
of  T  and  p  are  independent  Exponentials.  Finally,  in  Section  5  we  present  recursive  equations 
to  compute  the  expected  value  of  Q(x,£)  when  the  elements  are  independent  Erlangs. 

3.  MULTIVARIATE  NORMAL  DISTRIBUTION 


We  assume  that  £  =  (p,T)  has  a  multivariate  Normal  distribution  in  Rmir+])  .  Q(x,£) 
may  be  written  explicitly  as 

(1)  Q(x.|)  =  q\(Tx  -  p)+  +  q2(  Tx  -  p)~ 

where 

(  •  T  =  Max(  ■  ,0) 


and 


(  ■  r  =  — Min(  .0). 


Denoting  the  ith  element  of  Tx  -  p  by  Y:  we  have 
(2>  L  =  ±_tux,  -  p, 

i- 1 

where  _t„  and  p  are  the  random  elements  of  the  arrays  T  and  p,  respectively.  Defining  p  , 
and  <r  j  as  the  expected  value  and  variance  of  Y,,  we  have 


Ter 
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(3)  Hy  =  £,*,£(_/,/]  -£[  £,  1 
and 

(4)  =  Var[  p ,)  +  ^x^Varlj,,]  ~  2^/x/Cov[  p,,_r  J  + 

ItXjXkCovU,,.  J_ik^ 

k*  i 

where  E  l  ],  Var  [  ],  and  Cov  [  3  denotes  expected  value,  variance,  and  covariance,  respec¬ 
tively. 

Denoting  the  ratio  of  mean  to  standard  deviation  of  T,  by  a,,  i.e., 

(5)  a,  =  Hr/(Ty, 

it  is  easy  to  show  (see  Appendix  A,  Lemma  1)  that 

(6)  £(F,+]  “  M  y<t>(a,)  +  arY<t>(oti), 

where  d>(  •  )  and  (/>(■)  represent  the  cumulative  distribution  function  and  the  density  function 
of  the  standard  Normal  random-  variable.  Similarly, 

(7)  E[Yy)  =  —  m  d>(— «*,)  +  cr  y  <£(a,). 

Hence,  the  equivalent  deterministic  convex  program  may  be  written  as 

Min  2  =  cx  +  £<7|,[  M  *-<!>(<*,)  +  or  r  0(a,)l  + 

x€K 

m 

(8)  £<72i(  M  r  <t>(  -  a,)  +  <T  y  0(a,)], 

i-i 

K  =  {x\Ax  =  b,  x  >  0}. 

LEMMA:  If  q\  =  <7j  =  <7  problem  (8)  reduces  to 

m 

(9)  Min  Z  -  cx  +  T  <?,  (2m  y  <t»  (a ,)  4-  2  cr  r<j> (a,)  -  Hr) 


PROOF:  Noting  that  d>(-a,)  =  1  -  $(«,),  we  substitute  qu  =  qli  =  q,  in  Eq.  (8).  For 
individuals  with  quadratic  utility  functions,  it  can  be  shown  that  their  expected  utility  is  a  linear 
combination  of  the  expected  return  and  variance  of  return.  Though  a  quadratic  utility  function 
implies  increasing  risk  aversion,  this  is  a  first  attempt  to  explicitly  incorporate  risk  in  the  con¬ 
text  of  stochastic  programming.  In  that  spirit  we  suggest  the  following  stochastic  program: 

(10)  Min  Z1  =  cx  +  £[(?(x,£)l  +  A  Var  (C?(x,£)3, 

xtK  ~  ~ 


where  A  may  be  interpreted  as  the  decisionmaker’s  risk-aversion  factor. 


(11) 


Var(()(x.£))  =  Var 


m  m 

£<7liX+  +  iX.~ 


>-\ 


=  I^VarX+J  +  I^VarX']  + 
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Z,Z>0i/0i,Cov[i:,+.K,+]  + 

/*< 

1,1,92, 92,  Cowlir.Y]-]  + 

/*/ 

2l„Zi9i.Q2jCovirr.U\. 

where  Var  [T,+]  can  be  written  as  (see  Appendix) 

(12)  Varl  K,+]  =  [4>(a,)  +  d>2(<*,)]  + 

(t\  [<£2(a  1  —  a,<f>(a,)  +  <I>(a,)]  + 

2m  y  y  I0(a,)  +  (/>(a,)d>(a,)]. 

Similarly, 

(13)  VartK,']  =  m  f  [<!>(—«,)  +  <t>2(-a,)]  + 

cr  £  [<£2(a()  +  a,<f>(a,)  +  <t>(  —  a,)]  — 

2m  y  cr  y  [<t>(a,)  +  <j>(a,)<b(  —  a,)] 

We  next  focus  on  the  covariance  terms  in  expression  (11).  These  are  all  double  integrals  and 
though  we  cannot  reduce  them  to  simple  closed-form  expressions,  we  give  below  their 
equivalent  forms  ns  single  integrals.  This  should  considerably  reduce  the  computational  effort 
for  evaluating  them. 

Consider  Cov[T,+,  T,+];  it  may  be  written  as 

Cov[T,+  X+]  =  £[T;\r,+]  -  £[T,+]£II,+]. 

Using  Jacobians,  it  can  be  shown  that  the  joint  distribution  of  (T,,  Yt)  is  bivariate  Normal.  In 
the  Appendix  we  provide  an  equivalent  form  for  E[T,+  Yf  J.  This  is  reproduced  below. 

(14)  ElJCf'Yi’]  =  f0  y,ffn(y,)<t>(a0'l))  +  s0>,)<Ma0’())]<//'0'/), 

where 

(15)  a(y,)  =  miyj/siy,) 

Eir,lr,-y,1 

cry, 

(16)  =  M  y  ~  P„— —O'/  -  M  y  ) 

<Ty 

(17)  s(y,)  -  {Var[K,[T(  =  y,J|2 

”  o>(l  -  P },) 

where  p,,  is  the  correlation  between  Y,  and  Y,  and  is  given  by  (see  Lemma  4  in  Appendix  A) 

(18)  p„  =  Co v(T,,  Y,)/<r  )  (r  , 

=  - [L*  Z,**- XrCov(  l,k.  I  lr)  ~ 

CT  y(T  y  ( 
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£*.»r*{Cov (_/,*,  p ,)  +Cov(j,*.  p,) )  +  Cov(£,.  £,)j. 

The  covariance  terms  in  Eq.  (18)  are  obtained  directly  from  the  variance-covariance  matrix  of 
the  multivariate  Normal  distribution.  Similarly, 

CovlK,-.  L  ]  =  e\y;y;]  -  E[Y,  ]E[Yj] 

where  £[  T,"  Y~  ]  is  given  by  (see  Lemma  3  in  Appendix  A  )  the  following  expression: 

(19)  ^  £[y~T“]  =  J  Cv, )<b(-a O',))  -  sO',)</>(aO,))l^£0',)- 

Finally,  we  need  a  formula  for  Cov[T,+  ,  y ,“]  in  expression  (11)  to  evaluate  Var(()(x,  £)): 

cov[y,OF-]  =  £[  y,+ y,-]  -  £[  y,+l£[ y.  l 

where  £[K,+  F“]  is  simplified  in  Lemma  3  in  Appendix  A  and  is  given  by 

(20)  £[y,+ y~]  =  J*o  -T/[wO',)4>(aO',))  +  sO,)  ^(«0;))]^£0'/)- 

Hence  for  any  given  policy  x  €  K,  expression  (11)  can  be  computed  if  £  =  ( p,T )  is  multivari¬ 
ate  Normal  in  /?m<"+1).  ~  ~  ~ 


If  the  elements  of  £  are  correlated,  the  covariance  terms  in  expression  (11)  will  have  to 
be  evaluated  numerically;  however,  we  need  to  perform  only  single  integrals  as  discussed 
above. 


4.  EXPONENTIAL  DISTRIBUTION 


We  consider  next  the  instance  where  the  elements  of  £  -  (p.T)  are  independent 
exponentials.  In  [6]  we  gave  expressions  for  £[(?Gc|)l  for  this  particular  case.  E[Q(x,£)] 
was  obtained  by  first  inverting  the  bilateral  Laplace  transform  of  Y,  to  obtain  its  density  func¬ 
tion  and  then  integrating  directly.  We  repeat  some  of  the  results  here  again  for  ,  *«npleteness 
and  also  present  formulas  for  VarCKx,^)  so  that  the  mathematical  program  (10)  may  be  used. 


Denoting  1/A „  and  1/A ,  as  the  means  of  t_u  and  P <  ar*d  defining  L,k  and  L,  thus: 


(21) 

II 

-4 

A,/ 

A.  ,i. 

X,  H - — 

a 

i*k 

A  ii  A  ,k 

1 

Xk  j 

A, 

x,  xk 

(22) 

£,  —  a  ,n  j 

X, 

/  (A ,  +  A  „  x 

we  have  the  following  expressions  for  f(y,)  and  E[Q(x,£))\ 


(23a) 

(23b) 

123c) 

and 


/O',)  =  L,  exp  (A  ij',) 


n 

= 


A  -  I 


=  (L,  +  Za  £,*)/2 


for  y,  <  0, 
for  y,  >  0, 
for  y,  =  0, 


.'4  i 


£(CH-v/)l  =  ZouEiVl  +  Z<?2, £[/,-], 


,-l 


M>  I  =  Z 0(.v„/Lt)! 


\i 
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(26)  £[y~]  =  (1/A  ,)2L„ 

We  next  consider  Var  Q(x,  f): 

m  m  m 

(27)  Var[<?(x.|)l  -  £9|2  Var  IT,+)  +  I<?22Var[J7J  +  2 2>1/<?2, Cov[  T,+,  Tf). 

/-I  /-I  /«l 

Consider  £|(T,+)2)  and  £[(K,-)21-  We  have 

(28)  £[(r,+)2l  =  5'~y?f(y,)dy, 
which  on  integration  by  parts  gives 

(29)  £[07)2]  =  2£M**A*)3- 

4-1 

Similarly,  again  integrating  by  parts  gives 

(30)  E[(Y~)2\  =  2LJ\}. 

Therefore 

(31)  Var  IT,+]  =  2£  Lik(xk/Klk)3  -  {£[T,+]}2 

t-l 

and 

(32)  Var  [y-]  -  2Z.,/A,3  -  £,2/A,4. 

Since  E[Y*Y~]  -  0,  we  have 

(33)  Cov[_K,+,  Y~\  =  -£(K,+]£f rrl. 

Hence  Eq.  (27)  can  be  computed  for  any  policy  x. 

5.  ERLANG  DISTRIBUTION 

In  [6]  we  presented  an  expression  for  £((?(x,f))  when  all  the  random  variables  were 
independent  Erlangs  with  shape  parameters  equal  to~two.  We  extend  this  result  here  to  the 
case  where  the  shape  parameter  is  any  integer.  Inverting  the  Laplace  transform,  in  this  case  by 
the  method  of  residues,  requires  differentiation  with  respect  to  the  shape  parameter.  We 
present  a  lemma  in  this  section  which  may  be  used  to  obtain  E(Q(x,0)-  Denoting  the  means 
of  J(;  and  p,  by  r,,/ A,-,-  and  rj A,,  respectively,  and  the  bilateral  Laplace  transform  of  Y,  by 
L\  ( s ) ,  we  have 

(34)  Ly(s)  =  (n,!^-  '  /  s+  1{(A,)7(A,-s)''} 

x,  /  *, 

with 

A„ 

Max  —  - —  <  s  <  A. 

>  x, 

For  s  >  c  where 

A„ 

Max - - 

/  x, 

there  is  one  pole  of  order  r,,  and  the  residue  R,  of  exp  (sy,)Z.2  (s)  at  s  =  A,  is  given  by 


(35) 
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i 

i 


(36) 

Hence 

(37) 


1 

d' 

0,-i) 

•  ds'‘~ 1 

/  O',)  -  ~R, 


(  'n/|"r-  /  «  +  ~  exp  {sy,} 


for 


y,  <  o. 

For  5  <  c  there  are  n  poles  each  of  order  r,„  j~  I . *,  and  the  residue  of  the  *lh  pole  (of 

order  rlk)  at  s  =  —  — —  is  given  by 

xk 


(38) 


(39) 


R*  = 


1 

dr‘k~l 

( ''/*_])! 

ds'‘k~ 1 

1 

d'‘k~' 

0,*-i)! 

ds'lk~ 1 

s  + 


n 


x*  I  * 

Xl 


Ly(s)  exp  (sy,) 


fn, 

i*k 


s  + 


{(X,)  '/ (A. ,  -  s)r,J  exp  {sy,] 


Hence, 

(40) 

and 

(41) 


/O',)  =  Z  R,k  for  y,  >  0 

*-i 


/( 0)  -  [  lim  /O,)  +  lim  f{y))/2. 

v,— 0+  —  0“ 


Denoting  the  constants  in  Eqs.  (36)  and  (39)  by  K,  and  Lh  respectively,  i.e., 
(42)  k,  =  (-x,)rn,  x' 

and 
(43) 


L,  -  (x,)rn, 


we  need  to  determine  the  (/•,■_  [)  th-  and  ( /-, A - 1 )  th-order  derivatives  of  G,  and  Hn  respectively, 
where 


(44) 

and 

(45) 


G,  =  K,  exp  (syjfl,  s  +  — 


H ,  =  L,  exp  {sy,(n  ,  (s  +  X„)  r"(s  -  X,)  \ 

i*k 


We  present  a  theorem  which  will  enable  us  to  systematically  determine  the  higher  order  deriva¬ 
tives  of  these  two  functions. 


THEOREM:  The  wth-order  derivative  of  the  product  of  n  differentiable  functions  is  the 
wth-order  binomial  expansion  of  their  sum,  with  the  terms  in  the  expansion  interpreted  as 
derivatives  as  shown  below;  i.e.,  given  n  differentiable  functions  /,(s),  j  —  1 . n. 
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where 


f/(s)  -  ~(f,-(s))  for  /  ■»  1,  ...  ,  m, 
ds 


fi(s) 


for  i  -  0. 


PROOF:  The  proof  is  straightforward  and  uses  the  results  for  the  derivative  of  a  product 
of  functions.  For  example,  for  n  -  3  and  m  —  2  we  have 

4y(/i (*)  •  /2(s)  •  /j<s)l  -  c/,(s)  +  /,(s)  +  /3(s))2 
ds 1 

(46)  -  /,2  (s)/2°  (s)/3°  (s)  +  /?  (s)/|  (s)/3°  (s)  +  /f  (s)/2°  (s)/}  (s)  + 

2/'  (s)/j  (s)/3°  (s)  +  2//  (s)/f  (s)/3'  (s)  +  2/P  (s)/2'  (s)f]  (s). 


Note  that  to  determine  the  wth-order  derivative  of  the  product  of  functions  we  need  all  higher 
derivatives  up  to  and  including  m.  Therefore,  if  in  expression  (44)  we  denote  exp  {syj  by 

s  +  —  |  by  /2(s),  we  have  for  any  integer  m  >  0 


/(s)  and  n, 

(47) 

Denoting  Is  + 


— /i(s)  =  y™  exp  !sv,}. 
ds 


by  gj(s)  we  have 


dm 

~(As))  -  ~- 
ds  ds 

=  d'n 
dsm 


n*  + 

/-i 

n*/(*)|> 

/-i 


which  according  to  the  above  theorem  is 
(48)  £g,(s) 

/-i 

which  can  be  expanded  in  a  manner  similar  to  Eq.  (46).  Note  that 
g,°0)  =  g,0) 
and 


(49) 


ge(s)  = 

ds 


II- 0 


s  4- 


for  e  =  1 . m. 


If  m  =  r,_),  we  can  compute  all  the  higher  order  derivatives  required  in  expression  (36), 
and  /O',),  y,  <  0  can  be  computed.  The  partial  expectation  of  Y,  for  y,  <  0  can  then  be  com¬ 
puted  by  direct  integration.  Similarly,  in  expression  (45)  we  define  /((s)  =  exp  («>■,),  /2(s)  = 
[J,  (s  4-  \„)  and  f3(s)  =  (s  -A,)  \ 

i*k 
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Equations  (47)  through  (49)  go  through  again  for  the  higher  order  derivatives  with  a 
slight  modification  of  expression  (48);  namely,  j  cannot  equal  k  in  expression  (48).  Denoting 
this  modified  expression  as  (48 ')  we  have 


(48') 


ds" 


:</(s)) 


i>,(s) 

/-i 


and 


ds' 


-(/,  (s)) 


(50) 


—is  -  a y 

dsm 

in  I 

«-0 


(s  -  A ,)  r 


Note  in  Eq.  (39)  m  =  r,4_,.  Therefore  Rlk  and  hence  /O',),  y,  >  0  can  be  obtained.  The  par¬ 
tial  expected  value  of  K,  for  y,  >  0  is  then  obtained  by  direct  integration. 


EXAMPLE;  We  consider  a  problem  where  T  is  (2  x  2)  with  each  element  of  T  and 
having  an  independent  Erlang  distribution.  Denoting  R  as  the  matrix  of  shape  parameters  of 
and  h  as  the  vector  of  shape  parameters  of  p  we  assume 


R  = 


2  3 

3  4 


and  h  = 


Note  that  the  (i,j) th  element  of  R  is  r{,  and  the  Ah  element  of  h  is  r,.  Using  Eqs.  (36)  through 
(49)  it  can  be  shown  that 


f(y\)  =  exp  {A^^tL^xJy!2  +  L2(x)yt  +  L3(x)]  for  y,  <  0 


and 


/(y,)  =  exp 


A|2 

— —  >”i 


*2 


[A/t(x)y2  +  Mj(x)y,  +  A/3(x)]  + 


exp  - 


-yi 


[M^x)y\  +  4/5(x) ]  for  y\  >  0. 


Hence, 


£|rr,,6C«  -2i2«  CM 


and 


A,3 


A 2 


E[Yf] 


6  M\(x)  2M2(x)  M}(x)  2A/4(x)  A/5(x) 


A|2 

4  +  AI2 

T  +  ' 

*  12 

T  +  “ 

A  ii 

T  +  ‘ 

^  II 

*7 

x2 

Xl 

x\ 

X I 

Similarly, 

/(y2)  =  exp  {A2y2l(/>i(x)y  t  +  P2(x)) 

and 


fory2  <  0 


/(y2)  -  exp  ( - — y2)[ Ai(x)y23  +  A2(x)y22  +  A3(x)y2  4-  /V4(x)]  + 

x2 


exp  ( - ~y2H/V5(x)y2  +  /V6(x)y2  +  A7(x)] 


for  y2  >  0. 
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Therefore, 


and 


£[T2+] 


£[K2-]  - 


2  />,(*)  P2(x) 

X  2  +  X22 


24A'IU) 


6  N2(x)  2N2(x) 
1  ,  U  +  /  .  >3 


*22 

4  ' 

* 22 

•*2 

-*2 

jV4(x) 


l  *2 


6Ns(x) 

Ail  4 

*1 


22V6(jc)  Ar7U) 
r ,  >3  +  i :  >3 


*21 

T  f 

*21 

2 

The  functions  £,(*),  A/,(jr),  A^(jc),  and  P,„(x)  for  /  =  1.  2,  3,  y  =  1 . 5,  k  =  1 . 7, 

and  m  -  1,  2  are  reported  in  Appendix  B.  Since  all  the  partial  expected  values  are  available, 
Q(x,  £)  can  be  computed. 


4.  SUMMARY 

We  presented  here  equivalent  deterministic  convex  programs  for  certain  special  instances 
of  stochastic  programs  with  simple  recourse.  The  problem  studied  had  all  the  elements  of  the 
technology  matrix  and  the  recource  vector  as  random  variables.  Previous  studies  (except  Han- 
sotia  [6])  reported  explicit  formulas  for  cases  where  only  the  resourse  vector  was  considered 
random.  The  specific  cases  studied  here  had  all  the  random  variables  generated  by  a  multivari¬ 
ate  Normal  distribution,  independent  Exponential  distributions,  and  independent  Erlang  distri¬ 
butions.  For  the  first  two  cases  we  also  extended  the  equivalent  deterministic  program  to 
include  the  variance  of  the  recourse.  Finally,  a  simple  example  was  given  to  illustrate  the  use 
of  the  "not-so-simple"  formulas  for  the  Erlang  case. 
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APPENDIX  A 

LEMMA  1:  If  T,  —  N(fiy ,  a- y)  and  ny/try  =  a  „  then 
E [_F,+]  =  M  y t  <t>  (<*/)  +  cr  y  (f>  (a,) 
and 

F[Xrl  =  —fly  ®  (— a j)  +  cr  Y  </>  (a,) , 

where  <£(•)  and  <J>()  are  the  density  and  the  cumulative  distribution  functions  of  the  standard 
Normal  random  variable. 

PROOF: 


(Al) 


exp 


-y  Kj'  ~  py)I<ty\1 


dy . 


Substituting  (y  -  (j.Y  )/aY  —  z  we  have  the  right-hand  side  (RHS)  of  Eq.  (Al)  equal  to 


(A2) 


\I2tt 


t  f-a  ^xp|-|z2 


dz  + 


f-  1 

7^J-«,expf  V 


dz . 


The  second  expression  in  Eq.  (A2)  is  merely  -  <1>(— a()],  which  may  also  be  written  as 
M  y:  4>(a,).  Substituting  v  for  -yz2  in  the  first  integral,  it  becomes 


(T  y  r«  ^  j 

rJ,  !e'Vrfv  =  °’L^exp 


■J2n 


\[2tt 

=  C T  y  <£(«,)  . 


■ia'2 


Hence  £1  is  given  by 

fly  <t>  (a,)  +  (T  y  . 
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To  prove  the  second  result  we  note  that 


(A3)  jcr  -  i-xr 

Denoting  - by  W  and  the  mean  and  standard  deviation  of  fV  by  fi  w  and  <r  w,  we  have 
E[W^\  —  jiw<t>(a)  +  cr  (a ) 


where  a  “  n  wt<r  w ■  Noting  that  nw  “  — Mx  ant*  a  w  —  &y  ,  we  have 

—  —  —  /  ~  ~ , 

E[Y~\  —  —  fi  k.<1> ( —  a,)  +  (Ty  <t>(cti) . 


Q.E.D. 


Note  how  similar  E [  F,+ ]  and  E[Y~]  are.  To  obtain  the  latter  all  we  need  to  do  is  replace  ny 
by  ~n  y  and  -a,  by  a,.  Also  note  that  because  of  symmetry  (—a,)  =  <£(<*,). 


LEMMA  2:  If  Yt  ~  M/x  y  t  cr  y)  and  a,  ■=  /x  y  Jcr  y  ,  then, 

VarlYj]  —  (iY  .2f4>(a,)  +  <J>2(a,)]  +  o-y2[^2(a,)  —  «,$(«,)  4-  <J>  (<*/)]  + 
y  cr  >*.[<£  (a,)  +  ^(a, ■)<&(«/)] 

Var[Tf]  -  fiy, 2 [<&(— a,)  +  4>2( — ae<)]  +  cry.2t02(a,)  +  a,0(a,)  +  <J>(-a,)]  — 

^Ii(TIi^(a')  +  («,)<!»( — C5£,)]  . 


PROOF: 

(A4)  Var  [Y?]  =  £[  F,+)2]  -  {£(  K,+])2 

(A5)  £[(F,+)2]  -  J0"  y2dfy(yi) 


Substituting  z  —  (y  -  /x  k ()/ct  r  in  Eq.  (A4),  we  have  the  RHS  of  Eq.  (A4)  equal  to 
/-*,  z2exp(-y  r2  rfz  +  2Mr<rr  2  exp  j-  y  z2J</z  + 

Mr,2  J*_a  exp  -y  z2j</z] . 

J"_a  z  exp|-y  z2|dz  is  merely  0(a,)  (see  Lemma  1)  and 

=  exp|-yz2|^2  -  1  -  <!>(-«,)  -  ‘P (a ,) . 

/_a  z2  expj-  y  z2Jrfz.  Integrating  by  parts  and  noting 


Hence,  let  us  consider  / 


'Jin 


that 


; 


z  exp 


-lz2 


dz  -  -  exp{-  y  z2|  (up  to  a  constant), 
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we  have  /  equal  to 


-  -  a, <6  (a,)  +  1  -  $(— a*) 


-  -  a, <£(«,)  +  <l>(a,)  • 

Hence  £[(T,+)2]  is  given  by 

a  1, (-«,«(«/)  +  0(a,)]  +2ftY<rrMoti)  +fiy2i>(ai).  Q.E.D. 

Substituting  £[T,+]  (from  Lemma  1)  into  (A5)  and  using  the  above  result  we  have 
Var[K,+]  -  /i|  [<t>(a,)  +  <t>2(a,)]  +  or  y  2[<jb2(a,)  -  a,d>(a,)  4-  $(a,)]  + 

(A6)  2 fiyv  y  \<f»(ai)  +  <£(a,)<t>(a,)] . 

Q.E.D. 

The  second  result  is  obtained  quite  easily  by  substituting  into  Eq.  (A6)  -p  r  for  pY  and  -a, 
fora,  and  noting  that  <£(-«,)  =  <£(«,)• 


LEMMA  3:  If  (Y,,  Yj)  is  Normal  in  R3,  then 

(i)  £[JjX+1  “  J0  Yj [w  (^/)4> (a (}>;))  +  s(yj)<t>(a(yj))]dF(yj), 

(ii)  E[YjY~]  «  §  ^  yjlm(yj)Q(—ct(yj))  -  s(yj)4>(a(.yj))]dF(.yj) , 
and 

(hi)  E[YfY~]  -  J*o  -  yj[m(yj)<b{a{yj))  +  s(yj)<t>(a(yj))]dF(yj) , 

where 

where  m(yj)  =  £[T,|  Yj  =  yj ], 

X 

s(x,)  =  (Var[T,|JK,  —  j>y]) 2 ,  and 

a(yj)  =  m(yj)/s(yj ) . 

PROOF: 

(i)  E  [  Y?  Xt  1  -  J*0~  J0~  W,  dFiyijj) 

(A7)  - /0  fQ  y,y,dF  {yi\yj)dF(yj) . 

But  is  Normal  in  /?  with  mean  m  O',)  and  standard  deviation  s  O',).  where 

(A8)  trr 

m  ty)  —  fly  —  Pfj— (y ■  -  p  Y ) 

'  C/  y/  —  i 
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and 

(A9)  j(yy)  -  <ry  (1  -  p}j) 

with  Pa  being  the  correlation  between  F,  and  Fy.  Equation  (A7)  may  be  written  as 

X  y\So  dF(-y,)  -  X  yjE[Y*\Zi- yj\ dF(yj)- 

Using  Lemma  1,  we  have 

-  yj\  -  /n(yy)tj>(a(yy))  +  s0’y)d>(a0'y)) , 

where 

( A10)  a  (yy)  -  m  ( y,)ls  (yy) . 

Hence  E  (F,+ F,+)  is  given  by 

J*Q  yt  [m  (yj)<t>(a  (yy)  +  s  (yy)d> (a  (yy) )]dF(yj) . 

(ii)  In  an  identical  fashion  we  prove  the  second  result: 

EIY-YJ]  -  fl  fl  yiyj  dF(yi\yJ)  dF(yj)  =  -  yfE[Y~\Y)  -  y,i  dF(y,) . 

Using  Lemma  1  we  have 

ElY~\Yj  “  >"/]  ”  —  m Cvy) (—a (yy) )  +  j(yy)0(a(yy) 

where  m  (y;)  —  E  (F,|  Fy  —  yj)  and  is  given  by  expression  (A8).  s  (yj)  -  {Var  [F,|  F,  - 
and  is  given  by  expression  (A9)  and  «Cvy)  -  m  iy^/siy,).  Hence 

E\Y~Yj)  -  f  ^  yj[m  (y^Qi-aiy,))  -  s0'y)<£(a0'y))]dF0'y) 

Finally, 

eiu  .Hi  -  X 1 X” " 

-  Xl  -  y>  [X  yidF<yi\yi)]dF(y,) 

-  J*  „  -  (>’;)4,(a0'y))  +  s(y,)(b(a O',))]  dF(y/) 

C 

Note  that  p,,  =■  Cov  ( Yt,Yj)/cr  Y{  <r  Y  ;  we  present  next  a  Lemma  to  compute  Cov  {Y,,Yj) 

LEMMA  4:  If  F,  =  I*  J_lk  xk  -  p,  and  F,  -  I*_0*  x*  -  py, 

Cov(  F,,  Fy)  *  E*  XfcXr  Cov(_/ )y;ir ,,.)  E*  x*  Cov  (_/, •*,,£,) 

I*  X*  Cov (_/,*,£,)  +  Cov(p,,p,). 

PROOF:  Consider 

(AH)  Var[F,  +  F,]  -  Var[F,]  +  Var  ( F,]  +  2Cov(F,,  Yj) 

and 

(A  1 2)  Var  I  y,  -  F,]  -  Var[F,l  +  Var[F,l  -  2Cov(F„  F,) . 
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Taking  the  difference  between  expression  (All)  and  (A12),  we  have 

(A13)  VarlT,  +  Yj]  -  VarlT,  -  Yj]  -  4Cov(Kj(  T,) . 

But, 

VarlT,  +  T,]  ■  L*x2(Var(_r,k)  +  Var(_fy*))  +  Var (p,)  +  Var (p,)  + 
x^xr Covl^r _r ,f)  2T^xJicCov(_r^,  p,)  + 

r*k  ~ 

X*  I,  xkxrCo\(j_jk,Sjr)  ~  21kxkCo \UJk,  Pj )  + 

1A14J  n *k  ~ 

2 X Kf Xr x„xfCov(_r jk, ,r)  2 T.kxk Cov(_/ ,,, ,  p,) 

21kxkCov(j_(k,  pj)  +  2Cov(p,,  py) . 

Similarly, 

VarlT,  -  Tyl  -  X*x*(Var(_f,*)  -  Var(_/Jk))  +  Var(p,)  +  Var (p,)  + 

I*  I,  x*xrCovCr -  2'LkxkCov(_lik,p  J  + 

ra<5*  ~ 

(AlS)  2-/c  xkxrCo\{j_jktJ^jr)  2Y.k  xkCo\(_t_jk,  p  j) 

2T*  ^rxAXrCov(_/, *,_/,/)  +  2  X*  x*  Cov  ,  p ,)  + 

2'LkxkQosUik,  p_t)  -  2Cov(p,,  p^j) . 

Substituting  expressions  (A14)  and  (A15)  into  expression  (A13)  and  simplifying,  we  have 
Cov(T„  Yj)  =*  Irxkx, Cov(_r(7c,_r/r)  -  X*  x*  Co  v  p,)  -  I.k xkCov(_iik,  p,)  +  Co v(p,.p,). 
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The  following  functions  appear  in  the  example  in  Section  3. 
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ABSTRACT 

The  primary  goal  of  ihis  paper  is  lo  establish  properties  of  the  inventory 
and  advertising  policy  minimizing  the  expected  discounted  cost  over  a  finite 
horizon  in  a  dynamic  nonstaiionary  inventory  model  with  random  demand 
which  is  influenced  by  the  level  of  goodwill.  Under  linearization  of  the  cost  as¬ 
sociated  with  the  maximum  inventory  and  the  advertising  effect  on  demand, 
the  model  is  shown  to  be  equivalent  to  an  inventory  model  with  disposal. 

Many  results  of  this  paper  are  extended  lo  cover  convex  ordering  cost  of  in¬ 
ventory  and  time  lag  in  delivery  of  stocks. 

1.  INTRODUCTION 

The  problem  of  optimal  inventory  management  when  the  demand  is  partially  controllable 
is  the  main  focus  of  this  paper.  Few  authors  have  attempted  to  analyze  traditional  inventory 
control  models  where  the  level  of  demand  is  a  choice  variable;  generally,  these  attempts  were 
concentrated  on  models  where  the  demand  is  a  function  of  the  inventory  level.  One  way  to 
model  the  control  one  can  exercise  on  the  demand  is  to  specify  that  the  demand  is  an  increas¬ 
ing  function  of  advertising  expenditures.  Along  those  line,  even  fewer  authors  have  attempted 
to  optimally  integrate  inventory  policy  with  advertising  policy.  In  fact,  the  only  reference  we 
know  of  is  Miercourt  [9].  In  this  paper,  we  shall  integrate  them  in  the  context  of  a  discrete 
time  dynamic  inventory  model  involving  a  single  commodity  with  random  demand,  which 
depends  on  the  level  of  advertising.  Under  certain  restrictions,  the  solution  to  the  inventory 
problem  with  advertising  is  equivalent  to  Fukuda’s  {4}  solution  to  the  inventory  model  with 
disposal.  Also  Veinott  [20]  and  Ignall  and  Veinott  [5]  have  established  the  existence  and  the 
characterization  of  the  optimal  policy  for  a  dynamic,  nonstationary,  muitiproduct  inventory,  of 
which  many  of  our  results  in  Section  3  could  be  regarded  as  applications.  Topkis’  [17]  and 
Veinott’s  [23]  work  on  subadditive  functions  on  sublattices  and  its  application  to  inventory 
problems  by  Veinott  [22]  provides  us  with  the  proper  tools  and  methods  for  our  analysis.  A 
knowledge  of  their  theory,  at  least  to  the  extent  of  the  Appendix,  is  essential  to  understand  this 
paper.  An  abundant  literature  (see  Balcer  [2])  covers  the  problem  of  optimizing  advertisement 
expenditures,  given  a  known  demand-advertising  relationship  and  no  inventories.  Most  of  it 
generalizes  the  model  of  Arrow  and  Nerlove  [1],  which  introduces  the  concept  of  goodwill 
increasing  with  advertising  and  decaying  exponentially  over  time. 

The  problem  faced  by  the  manager  of  a  store  with  important  inventory  costs  and  an 
advertising  budget  is  the  theme  of  this  paper.  These  problems  are  important  because  not  only 
can  jointly  managed  advertisement  and  inventory  policies  reduce  costs,  but  such  policies. 
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independently  managed,  may  very  well  reduce  profits,  well  below  the  level  obtained  in  the 
absence  of  advertisement.  Though  the  present  model  is  developed  on  the  theme  of  a  store 
manager  in  charge  of  advertising  and  inventory,  the  basic  equation  could  just  as  well  be  reinter¬ 
preted  as  trying  to  maintain  the  inventory  at  a  positive  level,  like  maintaining  a  work  force  or  a 
number  of  working  pieces  of  equipment  at  a  prescribed  level  (let  S  be  convex  with  a  minimum 
at  the  prescribed  level).  In  the  example  of  the  work  force,  the  demand  is  simply  the  random 
number  of  individuals  who  will  quit  minus  gib ),  the  number  prevented  from  doing  so  or 
rehired,  as  compared  to  the  number  of  new  workers,  y  —  x,  obtained  outside  of  the  organiza¬ 
tion. 

In  Section  2,  we  set  up  an  inventory  model  with  advertisement.  In  the  next  section,  we 
present  and  interpret  conditions  for  the  existence  of  an  optimal  solution.  In  Section  4,  the 
optimal  solution  is  characterized  and  shown  to  be  nondecreasing  in  the  initial  inventory  and 
advertisement.  In  Section  5,  results  on  the  monotonicity  of  the  optimal  solution  are  extended. 
In  Section  6,  the  problem  is  reduced  to  an  inventory  model  with  generalized  disposal.  By  gen¬ 
eralized  disposal,  we  mean  that  purchasing  and  disposing  of  inventory  could  both  take  place 
concurrently  at  the  beginning  of  every  period.  When  advertisement  and  ordering  costs  are 
linear,  this  is  equivalent  to  the  usual  disposal  problem  discussed  by  Fukuda  [4], 

In  Section  7  many  results  of  Section  4  are  extended  to  a  model  where  convex  inventory 
ordering  cost  is  allowed.  In  Section  8,  under  additional  assumptions,  the  model  with  lag  in 
delivery  and  promotion  is  shown  to  have  the  same  properties  as  the  model  with  no  lag. 

2.  MODEL 

In  this  paper,  we  will  study  a  discrete  time  dynamic  model  of  single  commodity  manage¬ 
ment,  when  the  nonnegative  demand  for  the  commodity  in  each  period  is  uncertain,  but  has  a 
known  distribution  depending  on  the  existing  goodwill.  At  the  beginning  of  each  period,  the 
manager  knows  the  initial  inventory  x  and  goodwill  a  >  0,  the  present  and  future  demand  dis¬ 
tributions,  and  the  cost  structure.  He  decides  to  instantaneously  increase  the  initial  inventory  x 
to  a  level  y  ^  x  by  ordering  at  a  unit  cost  c,  and  the  initial  goodwill  a  to  a  level  b  ^  a  by 
advertising  at  a  unit  cost  p.  The  random  demand  gib)  +  U  depends  on  the  goodwill  and  a 
nonnegative  random  number  U.  We  assume  that  g(0)  =  0  without  loss  of  generality  since  a 
constant  can  always  be  added  to  U.  During  this  period,  the  demand  gib)  +  U  occurs,  so  the 
terminal  inventory  becomes  y  —  gib)  —  U  and  the  terminal  goodwill  is  unchanged.  The  initial 
inventory  and  goodwill  in  the  next  period  are  respectively  tj(v  —  gib  )  -  U]  and  6b,  9  ^  0  and 
■qiz)  =  Tj+z+  —  r )_z",  where  tj+  >  tj_  ^  0,  z+  =  max  iz,  0),  and  z~  =  max  (— z,  0).  When  the 
slope  of  r)  is  between  zero  and  one,  (1  —  tj+)  is  interpreted  as  a  depletion  or  loss  of  inventory 
and  (1  —  t)_)  as  a  loss  of  sales  arising  from  the  impatient  consumers  who  depart  before  receiv¬ 
ing  their  orders.  Values  of  9  less  than  one  correspond  to  depletion  of  consumer  goodwill.  This 
assumption  corresponds  to  Zielske’s  [24]  finding  that  the  goodwill  declines  almost  exponentially 
over  time,  and  to  Tull’s  [18]  evidence  on  the  carryover  effect  of  advertisement.  We  introduce 
an  additional  concept,  the  book  value  of  the  terminal  inventory,  y  -  gib)  -  U ,  and  of  the  ter¬ 
minal  goodwill,  b ,  denoted  Hj'  —  gib)  —  U.b).  The  function  Tiz.b)  is  equal  to 
— Ac’rjfz)  —  Kp'9  b,  where  X  ^  0  is  the  discount  factor  and  the  primes  indicate  the  cost  func¬ 
tions  associated  with  the  following  period.  If  z,  the  present  terminal  stock,  is  positive,  the  next 
period  initial  stock  is  t),:,  and  the  manager  would  pay  the  then  discounted  price  Xc't).z  to  reach 
inventory  level  tj+z,  if  no  carryover  of  inventory  were  permissible  from  one  period  to  the  next 
If  z  is  negative,  the  next  period  initial  stock  is  tj  z,  and  the  manager  would  pay  the  then 
discounted  price  AcV*  to  satisfy  the  backlogged  demand  completely.  The  next  period  initial 
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goodwill  is  Ob,  and  the  manager  would  pay  the  then  discounted  price  Kp'Ob  to  reach  that 
goodwill  level,  if  no  carryover  of  goodwill  were  possible  from  one  period  to  the  next.  The  book 
value  of  the  last  period  terminal  inventory  and  goodwill  is  called  the  terminal  cost. 

During  each  period,  the  manager  incurs  a  capacity  cost  h  on  the  starting  inventory  and  a 
shortage-holding  cost  s  on  the  terminal  inventory.  Both  of  these  cost  functions  are  convex,  and 
the  function  s  increases  to  infinity  with  its  argument.  Moreover,  s(z)  is  nondecreasing  in  z  on 
the  nonnegative  real  line.  The  function  h  is  bounded  below.  There  is  a  unit  selling  price  r. 
The  current  sale  price  is  paid  when  each  consumer  demand  is  incurred.  This  yields  a  gross 
revenue  of  r[g(b)  +  U)  to  the  manager.  If  the  inventory  is  positive,  the  consumer  receives 
the  commodity  without  delay  until  either  the  demand  is  totally  satisfied  or  the  inventory  is 
completely  exhausted,  whichever  comes  first.  If  consumers  subsequently  depart  or  increase 
their  orders  without  being  served,  the  manager  refunds  or  pockets,  respectively,  the  then 
current  sale  price.  The  sale  is  final  only  when  consumers  receive  the  commodity  they  have  pur¬ 
chased.  We  assume  that  EU  and  Es(y  —  gib)  -  U)  are  finite,  where  E  denotes  the  expecta¬ 
tion,  and  that  the  demand  function  g(b)  is  increasing  and  concave  in  the  goodwill.  This  last 
assumption  has  been  verified  empirically  by  Shryer  [12]  and  Stone  [15]  for  mail  ordering, 
Telser  [16]  for  cigarettes,  Palda  [10]  for  drugs,  Clement  et  al.  [3]  for  milk,  and  Simon  [13,14] 
for  liquor. 

Given  a  finite  horizon  N,  let  C"ix,a)  be  the  minimum  expected  discounted  cost  in  the 
periods  n,  ,  N,  where  x  and  a  are,  respectively,  the  initial  inventory  and  goodwill  in  period 
n.  The  function  C"  can  be  calculated  for  each  period  by  the  dynamic  programming  recursion 

C"ix,a)  =  min  [c(y  —  x)  +  p(b  -  a)  4-  liiy)  +  £s[y  -  gib)  -  U] 

y^x.b^a 


(1)  -  /■[*(*)  +  EU)  +  (1  -  r,_)rE[y  -  gib)  -  U)~ 

+  XE  C'+'bi<y  -  g(b)  -  U),  Ob] I 

for  n  =  1 . N,  where  =  -cx  -  pa.  Every  symbol  in  Eq.  (1)  should  be  indexed 

by  «;  however,  when  no  confusion  is  possible  because  of  the  context,  the  index  n  is  suppressed 
throughout  this  paper.  For  simplicity,  we  will  assume  that  the  total  demand  is  uncorrelated 
from  one  period  to  the  next.  However,  the  results  to  be  proven  in  this  paper  can  easily  be  gen¬ 
eralized  if  the  basic  demands  in  a  given  period  are  correlated  with  the  basic  demands  in  a 
preceding  period,  independently  of  the  goodwill. 

It  is  convenient  first  to  transform  the  problem  to  one  with  no  ordering  cost,  a  technique 
which  has  been  used  by  Veinott  [20],  On  letting  C"ix,a)  =  C"(x,a)  +  cx  +  pa,  the  recursion 

(1)  becomes 

(2)  C"ix.a)  =  min  [Liy,b)  +  \EC"^[T\iy  -  gib)  -  U).  9b]), 

v  ^  x,b  ^  (j 

where 

(3)  Liy.b)  =  cy  +  hiy)  +  ip  -  \9p')b  +  Es  [y  -  gib)  -  U)  -  /•[£(/>)  +  EU] 

+  \r  -  -t)-ir  -  \c')]  fly  -  gib)  -  U]~  -  Ac'tj^ELv  -  gib)  -  UY . 
The  term  in  braces  in  Eq.  (2)  is  denoted  B"iy,b).  Under  this  transformation,  Cw,(.v.o)  =  0. 

Upon  regrouping  the  terms  in  Eq.  (3),  we  obtain 


- 
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(4) 

Liy.b)  -  Hiy)  +  P[g(6)]  +  Sly  -  gib)). 

where 

(5) 

Hiz)  -  cz  +  h(z), 

(6) 

Piz)  -  ip  -  \.9p')g~'iz )  -  rz, 

and 

(7) 

Siz)  -  £(s(z  -  U)  - \c'viz  -  U)  +  (1 

A  constant  term  —  rEU  has  been  omitted  as  it  clearly  does  not  affect  the  choice  of  the  optimal 
policy.  The  term  B"(y,b)  in  Eq.  (2)  is  not  necessarily  convex  since  it  is  the  sum  of  the  convex 
functions  with  the  concave  function  —  tj.  Hence  we  will  transform  Eq.  (1)  by  replacing  gib), 
gia ),  C"ix,a).  B"(y,b),  and  L„iy,b)  with  0,  a,  Cnix,a),  B"iy,0),  and  L„iy,0),  respectively. 
For  simplicity,  the  same  notation  is  used  for  C,  L ,  and  B  whether  the  goodwill  effect  (a  and  0) 
or  the  advertising  level  (a  and  b)  is  used.  The  resulting  recursion  is 

(8)  Cix.a)  -  min  [Hiy)  +  P(J3)  +  Siy  -  0)  + 

y'Zx.g^a 

A£Cn+1h(y  -  0  -  U),  g(9g-l(l3))]} . 

Because  gib )  is  increasing  in  b  and  gib)  =  0,  the  optimal  policy  0 ix.a)  associated  with  recur¬ 
sion  (8)  can  be  expressed  in  terms  of  the  optimal  policy  bix.a). 

In  the  new  problem  described  by  recursion  (8),  the  variable  0  can  be  thought  of  as  the 
additional  guaranteed  demand  that  the  manager  purchases  at  a  price  Pip).  This  situation  can 
arise  when  the  manager  discounts  the  merchandise  to  attract  additional  customers  or  to  increase 
the  quantity  purchased  by  customers.  In  the  remainder  of  this  paper,  the  term  goodwill  will 
instead  be  called  advertising  or  promotion.  The  term  advertising  is  used  when  the  presence  of 
goodwill  in  one  period  influences  the  level  of  subsequent  demands,  i.e.,  9  >  0,  and  the  term 
promotion  is  used  otherwise,  i.e.,  9  —  0.  Thus  the  effect  of  advertising  is  persistent  while  that 
of  promotion  is  ephemeral. 

From  here  on,  iy^ix.a),  0"ix,a ))  is  the  optimal  policy  in  period  n  whose  components  are 
the  optimal  inventory  and  the  optimal  controlled  demand  and  which  minimizes  the  right-hand 
side  of  Eq.  (8).  The  solution  to  the  minimization  of  the  total  costs  as  given  by  Eq.  (8)  when 
only  one  of  the  two  variables  can  be  chosen  and  when  that  variable  is, unconstrained  is  denoted 
v"ia)  and  0nix),  respectively.  Also,  the  solution  to  the  minimization  of  the  total  costs  in 
period  n  as  described  by  Eq.  (8)  when  the  two  variables  are  unconstrained  is  O'*",  0*").  This 
defines  the  base  stock  level  in  period  n  (as  we  shall  see  in  Section  3)  whose  components  are  the 
base  inventory  level  and  the  base  controlled  demand.  The  differences  y"ix,a)  -  0"ix,a)  and 
y*"  -  0  are  the  optimal  net  inventory  and  the  base  net  inventory  in  period  n,  rewritten 
z"ix,a)  and  z*",  respectively.  Finally,  the  solution  to  the  minimization  of  the  total  present 
costs  in  period  n,  as  given  by  the  right-hand  side  of  Eq.  (8)  with  the  last  term  omitted,  is  the 
myopic  policy.  In  the  preceding  sentences,  if  we  replace  superscript  n  by  subscript  n,  we  have 
the  myopic  counterparts  of  the  optimal  solutions.  For  example,  z'„  is  the  myopic  base  net 
inventory  in  period  n. 

3.  EXISTENCE  OF  OPTIMAL  SOLUTIONS 

Before  proving  the  existence  of  an  optimal  solution,  we  will  further  specify  the  admissible 
,  functions  g.  When  9  >  0,  we  also  assume  that  gip)  =  gi9g~'ip))  is  convex  in  0.  (The 
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reason  for  this  will  become  clear  shortly.)  This  means  that  each  additional  unit  of  demand  pur¬ 
chased  in  a  period  generates  more  demand  in  the  following  period  than  the  preceding  units  of 
demand.  Also,  since  g  is  increasing,  9  is  positive  and  g~'(fi)  is  positive  when  /3  is  positive,  it 
follows  that  g (J3)  <  /?  for  9  <  1,  g(j3)  -  /?  for  0  -  1,  and  g (J3)  >  p  for  9  >  1. 

Let  T  be  the  class  of  all  concave  increasing  functions  g  such  that  g(0)  —  0  and  g(fi)  is 
convex  in  /?.  The  class  T'  of  all  functions  kbv ,  where  k  >  0  and  1  >  v  ^  0,  is  a  subclass  of  T. 
Moreover,  gQ 3)  is  linear  in  f)  for  all  elements  of  T'.  If  9  <  1  and  k  >  0,  Zc  In  (b  +  1)  belongs 
to  r. 


To  ensure  that  the  minimand  in  Eq.  (8)  is  convex,  we  assume  that  the  following  condi¬ 
tions  hold  hereafter: 

CONDITION  1:  D+s( 0)  +  r  -  A c'y+  >  yAr  -  Ac')  +  D~s( 0) , 

CONDITION  2:  p  >  A 9p\ 

where  D+s(0)  (Z)~s(0))  indicates  the  right-(left-)  hand  derivative  of  s  at  zero.  The  left-hand 
side  of  Condition  1  can  be  regarded  as  the  marginal  revenue  generated  by  a  guaranteed  demand 
of  size  «  for  a  given  inventory  level  t,  where  e  is  an  arbitrarily  small  positive  number.  The 
marginal  revenue  consists  of  the  marginal  sales  price  minus  the  marginal  book  value  of  the 
inventory  sold  plus  the  marginal  savings  on  holding  cost  when  reducing  the  inventory  level. 
The  right-hand  side  term  can  be  thought  of  as  the  marginal  revenue  generated  by  an  arbitrarily 
small  guaranteed  demand,  given  a  nonpositive  inventory  level.  The  marginal  revenue  consists 
of  the  marginal  sales  price  on  preserved  sales  minus  the  marginal  book  value  of  the  inventory 
sold  minus  the  marginal  increase  in  shortage  cost  when  reducing  the  inventory  level.  Therefore 
Condition  1  means  that  the  marginal  revenue  at  an  arbitrarily  small  positive  inventory  level, 
generated  by  an  equally  small  guaranteed  demand,  must  be  no  smaller  than  the  marginal  reve¬ 
nue  at  any  nonpositive  inventory  level,  generated  by  an  arbitrarily  small  guaranteed  demand. 
Condition  2  asserts  that  the  present  price  p  of  a  unit  of  goodwill  is  greater  than  its  book  value. 


Thus,  both  S  and  P  are  convex,  since  g  is  concave  and  increasing  and  therefore  g_l  is 
convex  and  increasing.  By  the  assumptions  of  Section  2,  H  is  convex  and  P  is  continuous  at 
zero.  From  this  point  on,  we  will  deal  with  a  problem  with  convex  costs  //,  P ,  and  5  such  that 
P  is  continuous  at  zero.  We  now  show  that 

PROPOSITION  1:  Under  Conditions  (1,2),  C"(x,a)  is  convex  and  nondecreasing  in 
(x,  a). 

PROOF:  Of  course  C/v+l  =  0  is  convex.  By  the  induction  hypothesis,  Cn+1(x,a)  is  con¬ 
vex  in  (x,a).  Hence,  because  nondecreasing  convex  functions  of  convex  functions  are  convex, 
tj(y  -  p  -  U)  and  ECn+x[y(y  -  (3  -  U),  g(/3)l  are  convex  in  0\/3).  Thus  B"(y,p)  is  convex 
in  O',/?).  The  minimum  of  a  convex  function  over  a  convex  set  being  convex,  Cn(x,a)  is  con¬ 
vex  in  (x,o).  Because  the  sets  over  which  the  minimization  takes  place  are  decreasing  in 
(x, a ) ,  the  function  C"(x,a)  is  nondecreasing  in  (x,a).  This  concludes  the  proof. 

Since  Bn{y,B)  is  convex  and  convex  functions  are  continuous,  Bn(y,p)  is  continuous. 
Since  BH(y,fJ)  increases  to  infinity  with  |y|  +  /?  under  conditions  to  be  described,  the  space 
over  which  the  minimization  takes  place  can  be  limited  to  a  compact  set.  Also,  the  intersection 
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of  a  closed  set  {>>  >  x,  p  >  a)  of  R2  with  a  compact  set  is  compact.  Since  the  minimizing  set 
of  a  continuous  function  over  a  compact  set  is  compact,  the  minimizing  set  Mix, a)  of  B"(y,i 8) 
over  {>>  >  x,  p  >  a}  has  a  lexicographically  least  element  (jKx,a),/3(x,a)),  called  the  optimal 
policy. 

We  proceed  to  show  that  Bn(y,p)  tends  to  infinity  with  |j>  |  +  /3  under  the  following  addi¬ 
tional  conditions,  which  are  assumed  to  hold  hereafter  in  this  paper: 

CONDITION  3:  D~H(°°)  +  D~S( <*>)  >  0, 

CONDITION  4:  D~H( «>)  +  D~P(° °)  >  0, 

CONDITION  5:  D~P(°°)-  D+S(~ ~)  >  0, 

CONDITION  6:  D+H(-°°)  +  D+S(-°°)  <  0. 

The  interpretation  of  Conditions  (3-6)  is  very  natural.  Condition  (3)  states  that  with  fixed 
added  demand,  it  is  not  profitable  to  increase  the  starting  inventory  without  limit.  Condition 
(4)  states  that  it  is  not  profitable  to  increase  together  indefinitely  the  added  demand  and  the 
inventory.  Condition  (5)  states  that  when  no  inventory  is  ordered,  it  does  not  pay  to  increase 
the  added  demand  indefinitely.  Finally,  Condition  (6)  states  that  with  fixed  added  demand,  it 
does  not  pay  to  let  the  inventory  reach  a  very  small  level. 

PROPOSITION  2:  Under  Conditions  (1,2),  the  function  L„(y,p)  tends  to  infinity  with  M 
'+  /3  (fi  >  0)  if  and  only  if  Conditions  (3-6)  hold. 

PROOF:  The  only  if  part  follows  by  observing  that  since  £„(•,•)  is  convex.  Conditions 
(3-6)  are  respectively  equivalent  to  the  assertion  that  Ln(y,p)  —  °°  along  the  respective  half 
lines  y  ^  0,  p  =  0;  y  =  p  >  0;  y  =  0, P  >  0;  and  y  <  0,  p  =0. 

For  the  converse,  recall  first  that  the  convex  function  L„(y,p)  — *  °°  as  [y|  +  p  —>  °°  if 
that  is  so  along  every  half  line  emanating  from  the  origin.  Let  p  =  6y,  so  Ln(y,p)  associated 
with  Eq.  (5)  becomes 

0  (y )  =  L„  (y,  8y )  -  H  (y )  +  P  (8y )  +  5  ( ( 1  -  8  )y ) . 

CASE  1:  If  y  >  0  and  1  >  6  ^  0,  then 
lim  D~<f>(y)  =  lim  {D'Hiy)  +  8 D~Pi8y)  +  (1-S)DS((1  -S)y)j 

y— *oo  y—oo 

-  lim  (SfZ)-//^)  +  D~P(8y)]  +  0  —h)[D~H O')  +  £>"5((1  —  8)y)JI 

y—oo 

=8tzr//(w)  +  zr/>(°°)]-Mi-5)[z>-//(°o)  +  zrs(°o)]  >  o 
since  Conditions  (3,4)  hold. 

CASE  2.  Ify  >0  and  8  >  1,  then 
lim  D~<f>(y)  —  lim  {D~H(y)  +  8D~P{Sy)  +  (1  -8)D+S((1  -8)>>)) 

y — »oo  y—oo 

-  lim  \[D-H{y)  +  D-PiSy))  +  (8  -  l)[ZrP(8.y)-Z>+S((l  -8)>)]} 

y—oo 

-  [£)-//(»)  +  D-P(oo)]  +  (8  -  l)[D-P(oo)  -  D+S(-oo)]  >  0 
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since  Conditions  (4,5)  hold. 

CASE  3:  Let  y  <  0  and  8  <  0,  then 

lim  D+<f>(y)=  lim  [D^H(y)+8D~P(8y)  +  (1  —  8)Z>+S((1  - 8)^)} 

y — •— oo  y— — oo 

=  lim  ([Z>+//(y)  +  .D+S((l-8).K)]  +  8[Z)-/>(8^)-£)+S((l-8)^)]) 

y  *— oo 

=  tD+//(-oo)  +  D+S(-»)]  +  s[D-P(oo)-Z)+S(-oo))  <  0 
since  Conditions  (5,6)  hold.  This  completes  the  proof  of  Proposition  2. 

PROPOSITION  3:  If  Ljiy.b)  tends  to  infinity  with  M  +  b  for  /'  =  n, . . . ,  N,  then  Bn(y,b) 
tends  to  infinity  with  |y  |  +  b. 

PROOF:  This  is  obviously  true  for  BN  since  BN  =>  L\.  By  the  induction  hypothesis,  Bn+l 
has  the  property.  Hence,  by  convexity  B"+l  is  bounded  below  on  R  x  R+.  Thus,  C"+1  is 
bounded  below.  From  this,  we  conclude  that  B"(y,b)  tends  to  infinity  with  [y|  +  b.  This 
proves  the  assertion. 

Proposition  3  completes  the  proof  of  the  existence  of  the  optimal  policy. 

In  the  rest  of  this  section,  we  will  establish  sufficient  conditions  to  warrant  the  existence 
of  nonnegative  base  inventory  levels.  When  the  base  inventory  levels  are  nonnegative,  the  res¬ 
triction  t)_  <  ■»}+  can  be  waived  since  the  optimal  policy  (y(x,a),  /3(x,a))  is  greater  (see 
Theorem  1  in  Section  4)  than  or  equal  to  (y*,  /3*)  >  (0,0)  for  all  (x,a),  and  since 
DrC^'M*’),  g(v)l  -  0  for  all  w  <  0  and  v  ^  0,  where  D~C  denotes  the  left-hand  deriva¬ 
tion  of  the  function  C  with  respect  to  its  /  th  argument.  Obviously,  Conditions  (1-6)  must  still 
hold. 


We  assert  that  if  the  myopic  base  inventory  level  for  each  period  is  nonnegative,  then  the 
base  inventory  level  is  also  nonnegative  in  each  period.  We  show  this  by  induction.  Since 
CN+]  is  zero,  the  myopic  base  inventory  level  in  period  A  is  equal  to  the  period  Abase  inven¬ 
tory  level.  By  induction,  the  result  holds  for  period  n  4-  1.  Therefore,  D\  Cn+'(x,a)  -  0  on 
/?_  x  R+,  Since  tj(x)  ^  0  for  x  <  0,  6  is  nonnegative,  and  the  demand  is  nonnegative, 
ED\  Cn+l[Tj(_y  —  B  —  U),  g(fi)]  -  0  for  all  (y,p)  €  /?_  x  R+.  This,  with  the  fact  that  the 
myopic  base  inventory  level  is  nonnegative  in  period  n,  implies  that  the  period  n  base  inventory 
level  is  nonnegative,  completing  the  proof  of  the  assertion. 

An  expedient  method  to  guarantee  nonnegative  base  inventory  levels  is  to  impose  the  res¬ 
triction  y  >  0.  Another  method  is  to  impose  an  additional  condition  on  the  cost  structure  such 
that  y(0)  ^  0,  which  is  sufficient  to  ensure  that  y *  ^  0,  since  0  <  y  (0)  <  yQ 3*)  —  y*  (see 
Theorem  1)  where  p*  ^  0.  Because  of  the  preceding  paragraph,  it  will  suffice  to  establish  the 
result  for  myopic  base  inventory  levels.  By  differentiating  Eq.  (4)  with  respect  to  y  with 
b  -  B  —  0,  we  obtain  the  desired  condition 

CONDITION  7:  D~H(Q)  +  ZrS(0)  <  0, 

which  ensures  nonnegative  base  solutions.  For  inventory  problems  with  perishable  goods,  i.e., 
7j+  -  0,  the  nonnegative  base  inventory  guarantees  that  the  optimal  policy  is  independent  of  the 
level  of  backlogged  demand,  i\-x~. 
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4.  CHARACTERIZATION  OF  THE  OPTIMAL  POLICY 

In  this  section,  we  will  first  establish  that  C"(x,a )  is  subadditive  in  (x.a),  then  character¬ 
ize  the  minimizing  set  Mn(x,a)  and  the  form  of  the  optimal  solution.  We  remind  the  reader 
that  the  starting  stock  is  ( x,a ). 

PROPOSITION  4:  If  Conditions  (1-6)  hold,  C"(x,a)  is  subadditive  in  (x.a)  and  B"(y,p) 
is  subadditive  in  (y,/3). 

PROOF:  Clearly,  CN+ 1  =  0  is  subadditive  in  (x.a).  By  the  induction  hypothesis, 
Cv,+I(x,a)  is  subadditive  in  (x.a).  Therefore,  since  tj  is  convex  nondecreasing  and  gis  nonde¬ 
creasing,  EC"+,lr) (y  —  ft  —  U).  |(/ 3)1  is  subadditive  in  (y.B )  by  Example  A7.  (This  and  the 
following  examples  can  be  found  in  the  Appendix).  Because  S  is  convex,  S(y  —  1 3)  is  subaddi¬ 
tive  in  (y,/3)  by  Example  A6.  Thus,  by  Example  A8,  Bn(y,B )  is  subadditive  in  (y.P).  Since 
Bn(y,B)  is  independent  of  (x.a),  Bn(y,p)  is  subadditive  in  (x.a.y.) 3).  Also,  {y  >  x,  /3  ^  a) 
is  a  sublattice  of  R 4  by  Example  Al.  Since  the  minimum  of  a  subadditive  function  over  a  sub¬ 
lattice  is  subadditive  by  Theorem  Al,  (also  in  the  Appendix)  Cn(x,a)  is  subadditive  in  (x.a), 
completing  the  proof  of  the  proposition. 

We  now  turn  our  attention  to  the  set  Mn(x,a). 

PROPOSITION  5:  The  minimizing  set  M"(x,a)  is  a  nonempty  compact  sublattice. 

PROOF:  By  Proposition  1,  we  know  that  Bn(y,B)  is  continuous.  By  Proposition  4,  we 
also  know  that  B"(y,p)  is  subadditive  in  (y.ft).  Also  Bn(y,ft)  tends  to  infinity  with  |y|  +  (3. 
Therefore,  by  Theorem  A2  (see  Appendix)  the  minimizing  set  M"(x,a)  is  a  nonempty  compact 
sublattice.  This  completes  the  proof  of  the  assertion. 

The  lexicographically  least  element  of  Mn(x,a)  is  then  the  least  element  of  Mn(x,a)  and 
is  called  the  optimal  policy,  written  (y(x,a),  p(x,a)).  Also,  the  least  element  of  Mn(-°°,0 )  is 
denoted  O'*,/ 3*)  and  called  the  base  stock  level.  Before  further  characterizing  the  optimal  pol¬ 
icy,  we  recall  that  y(a)  is  the  least  y  minimizing  B"(y,(i)  subject  to  B  —  a  and  /3(x)  is  the  least 
B  minimizing  Bn(y,p)  subject  to  y  =•  xand  B  >  0. 

THEOREM  1:  Under  Conditions  (1-6),  y(x.a)  =  x  Vy(/3*V  a)  and  B (x.a)  =  a  V 
B  (y*  V  x)  with  y  and  B  being  nondecreasing  on  their  respective  domains. 

PROOF:  Proposition  5  and  Theorem  A2  imply  that  the  optimal  policy  exists  and  is  non¬ 
decreasing  in  (x, a)  and_that  /3  and  y  are  nondecreasing.  Also  (y(/3‘),  /3(y*)_)=(y*,0*).  Now 
for  (x.a)  ^  (y*,/3*),  (y(x,a),  /3(x,a))  -  (y_*,/3*).  For  x  >  y*  and  a  <  /3(x),  we  have  by 
the  convexity  of  B"  that  y(x,a)  =  x  and  so  j3(x,a)  -  B(x).  Similarly,  for  a  >  B*  and  x  ^ 
y(a).  y(x.a)  -  y  (a)  and  /3(x,a)  =  a.  Combining  these  two  facts  we  see  that  if  (y*,B *)  ^ 
(x.a)  ^  (P (a ) ,  0(x)),  then  (x.a)  -_(y(x,a),  /3(x,a)).  Now  if  (y*,B *)  <  (x.a)  and  if 
(p (a),  (i(x))  <  (x.a),  then  (y(x,a),  Rix.a))  =  (x.a).  For  if_not,  there  are  three  possibili¬ 
ties,  viz.,  (i)  y(x,a)  >  xand  fi(x,a)  —  a,  (ii)  y(x,a)  =  xand  /3(x,a)  >  a,  or  (iii)  y(x,a)  > 
xand  B(x,a)  >  a.  If  (i)  occurs,  then  y(«)  “  y(x,a)  >  x  >  y(a)  which  is  impossible.  If  (ii) 
holds,  then  B(x)  —  B(x.a)  >  a  >  B(x)  which  is  also  a  contradiction.  Finally,  if  (iii)  holds, 
then  since  B"  is  convex,  Bn  is  nonincreasing  along  the  line  from  (y.B)  to  (y*,^*),  implying 
that  either  (i)  or  (ii)  holds,  contradicting  (iii).  This  completes  the  proof. 

To  sum  up,  we  have  shown  that  the  optimal  policy  is  as  depicted  in  Figure  1.  Also,  we 
have  proved  y  (fi  *)  —  y*  and  B  *  ”  4Cy*)- 
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For  this  paragraph,  we  further  assume  that  the  costs  and  the  demands  are  stationary  and 
that  the  depletion  factors  of  inventory,  17+,  and  of  goodwill,  0,  are  no  greater  than  one.  When 
the  initial  inventory  x  and  goodwill  a  are  smaller  than  the  base  stock  level  of  the  first  period, 
the  myopic  policy  is  optimal  in  every  period  [20].  It  is  optimal  to  order  ( y *  -  x,/3  *  -  a)  at  the 
beginning  of  the  period.  By  further  restricting  A  to  be  smaller  than  one,  the  model  easily  car¬ 
ries  over  to  the  infinite  horizon.  We  could  obtain  myopic  optimal  policy  for  this  problem  under 
a  nonstationary  structure  by  applying  the  conditions  defined  in  Veinott  [20]  and  in  Ignall  and 
Veinott  [5]. 


5.  MONOTONICITY  OF  THE  OPTIMAL  POLICY 

In  this  section,  we  shall  take  advantage  of  the  particular  structure  of  the  optimal  policy  to 
exhibit  its  monotonicity  in  the  initial  parameters.  Namely,  we  will  prove: 

THEOREM  2:  For  all  n,  y"(x,a)  and  /3"(x,a)  are  nondecreasing  in  (x,a).  Also  z"(x,a), 
x  -  fix,  a),  x  -  /3"(x,a),  and  x  -  z"(x,a )  are  nondecreasing  in  x.  Moreover,  a  -  p"  (x,o) 
and  a  +  z"(x,a )  are  nondecreasing  in  a.  Finally,  if  H  is  linear,  a  -  J'Tx.a)  is  unimodal  in  a, 
and  z"(x,a)  is  nondecreasing  in  a. 

PROOF:  By  Theorem  1,  y"(x,a)  and  p"(x,a)  are  nondecreasing  in  (x,a).  Let  y  —  xand 
P'  -  x  -  p,  and  consider 

min  (P(x  -  p')  +  Sip')  +  \ECn+'{y{p'  -  U),  gix  -  /3')]) . 

x-0’>O 

Since  Pis  convex,  P(x  -  P')  is  subadditive  in  ix,p')  by  Example  A6,  and  since  C"+1  is  subad¬ 
ditive  on  R  x  R+t  tj  and  £are  nondecreasing.  \EC"*l[j)ip'  -  U),g{x  —  j3')I  is  subadditive  in 
ix,p')  by  Example  A7.  By  Theorem  A2,  the  greatest  element  /S'  —  x  —  P"ix),  minimizing  the 
term  inside  the  braces_subject  to  P'  <  x,  js  nondecreasing  in  x.  Thus,  x  -  yn(x,a)  -  x  -  xV 
y*ip **  Va)  and  x  -^"(x,«)  —  x  -  aVP "  Cy*"Vx)  are  nondecreasing  in  x.  Also,  7"(x,a)  — 
xV J"(/S*"Va)  -  aV  P"iy',*V x)  is  nondecreasing  in  x.  Moreover,  x  -  znix,a)  —  x  -  yn(x,a) 
+  P"(x,a)  is  nondecreasing  in  x. 
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Now,  a  -  pn(x,a)  —  a  -  a  V  j3"(y*n\/  x)  is  nondecreasing  in  a.  Also,  a  +  zn(x,a)  —  a 
-  /3n(x,a)  4-  y"(x,a)  is  nondecreasing  in  a.  Let  /3  —  a  and  —y'  —  a  —  y,  and  consider 

min  {H(a  +  y')  +  50/')  +  A£C"+lb(/  -  t/),J(«)]} . 

y' 

If  H  is  linear,  the  term  inside  the  braces  is  subadditive  in  ( a,y ')  by  Examples  A5  and  A8  and 
the  fact  that  C"'+I(x,a)  is  subadditive  in  (x,a).  Therefore,  the  least  y'  —  ^"(a)  -  a  achieving 
the  minimum  is  nondecreasing  in  a.  Thus,  a  -  yn(x,a)  -  a  -  xV  v"(J3'"Wa)  is  unimodal  in 
a.  Since  z"(x,a)  =  x  \/yn(fi*"\/a)  -  avj3"(y*"\Jx),  zn(x, a)  is  nondecreasing  in  a,  complet¬ 
ing  the  proof. 

When  the  goods  are  assumed  to  be  totally  perishable,  i.e.,  t>+  =  0,  we  can  transform  the 
problem  into  a  convex  program  and  exhibit  the  monotonicity  of  the  optimal  policy  in  the  initial 
guaranteed  demand  a.  By  imposing  Condition  (7)  or  y  ^  0,  we  ensure  that  the  base  inventory 
level  is  nonnegative.  Since  the  goods  are  perishable,  x  <  0  always.  Also,  since  y*"  is  always 
nonnegative,  C"(x,a)  =  C'TO.a)  for  all  x  <  0.  Thus,  with  Cn(a)  =  C"( 0,a)  and  tj+  =  0, 
recursion  (8)  becomes 

(9)  C"(a)=  min  {//(>-)  +  P(f3)  +  S(y  -  /J)  +  kCn+'[g(P))} . 

y>  O.0>c 

Since  the  state  variable  in  Eq.  (9)  is  not  random,  it  follows  that  the  advertising  of  perish¬ 
able  goods  model  is  a  deterministic  convex  program,  viz., 

<10)  C’(«)  -  min  £  lH„(y„)  +  PM  +  S„(y„  -  f3n)] 

n—  \ 

such  that  y„  >  0  for  all  n,  /3„  >  for  all  n  >  1,  and P\  ^  a.  Thus  it  is  not  necessary  to 

find  the  optimal  policy  for  all  values  of  the  initial  controlled  demand  in  every  period.  Since  the 
right-hand  side  of  Eq.  (10)  is  subadditive  in  yn  and  /3„,  and  the  set  of  feasible  solutions  is  a 
nonempty  closed  sublattice  of  _R2N  that  is  bounded  below,  there  is  a  least  vector 
(y i1  (a),  ...  ,  y^ia),  /3l(a ),  ...  ,  /3 \(a))  minimizing  the  right-side  of  Eq.  (10)  over  the  feasi¬ 
ble  region.  These  vectors  can  be  found  by  nonlinear  programming  algorithms.  Let  z„'(a)  = 
y„'(a)-f}}(a). 

JTHEOREM  3:  When  r)+  =  0,  for  each  n,  the  functions y„l (a),  /3,J(a),  -z„'(a),  a  -  Jn(a), 
a  —  /3n(a),  and  a  +  z"(a)  are  nondecreasing  in  a. 

PROOF:  By  Theorem  A2,  (J„' (a)^j(a))  is  nondecreasing  in  a  ^  0  for  all  n.  Replacing 
y„  -  fJ„  by  —z„\  the  right-hand  side  of  Eq.  (10)  becomes 

(11)  min  Z  -  z'n)  +  Pn(fi„)  +  S„(-Z„')] 

n— ! 

subject  to  P„  >  z'„  and  ^  g(j 3„_|).  Since  H„  is  convex,  by_the  same  argument  as  above, 

the  (AM-1,.  . .  ,  2Af}-lexicographically  least  element  (/3|'(a) . /3^(a),  -zj1  (a) . -Zjv(a)) 

minimizing  Eq.  (11)  subject  to  the  constraints  indicated  above  is  nondecreasing  in  a,  so  z„'(a) 
is  nonincreasing  in  a  for  all  n. 

If  instead  we  let  /8'  —  a  —  j8  and  y'  “  a  —  y,  Eq.  (9)  becomes 


(12) 


C”(a)  ”  min  [H{a  —  y')  +  P(a  —  /3')  +  S(fi'  —  y')  + 

.F'<a.0'<O 

AC"  +  l[g(a  -  /3’)]|  . 
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Since  P ,  H,  S,  and  C"+1(f  (  ))  are  convex,  the  term  inside  the  braces  in  Eq.  (12)  is  subadditive 
in  (a.y'.ft')  by  Example  A6.  The  space  over  which  the  minimization  takes  place  is  a  sublattice 
by  Example  Al.  By  Theorem  A2,  the  greatest  element  (a  -  yn(a),  a  -ftn(a)),  minimizing 
the  right  hand  side  of  Eq.  (12),  is  nondecreasing  in  a.  Moreover,  a  +  F"(a)  - 
a  -  ft  "(a)  +  yn(a)  is  also  nondecreasing  in  a  by  the  above,  completing  the  proof. 

The  results  of  Theorem  3  are  stronger  than  those  of  Theorem  2,  as  not  only  the  current 
optimal  policy  is  increasing  in  the  initial  parameter  but  also  all  subsequent  optimal  policies.  As 
the  guaranteed  demand  due  to  past  advertisement  increases,  the  additional  impact  of  optimal 
current  expenditures  on  advertisement  must  not  increase.  Similarly,  the  optimal  inventory 
level  must  not  increase  as  fast  as  guaranteed  demand  due  to  past  advertisement. 

6.  GENERALIZED  DISPOSAL 

In  the  remaining  sections,  9  is  assumed  to  be  0.  The  assumption  0  =  0  means  that  the 
effect  of  advertising  is  ephemeral  and  therefore  the  initial  goodwill  in  each  period  is  a  =  0. 
The  problem  as  described  by  Eq.  (8)  is  thus  reduced  to 

(13)  Cn(x)=  min-  {H(y)  +  P(ft)  +  S(y  -  ft)  +  kECn+'[v(y  -  ft  -  U)]\ , 

yZx.p^O 

where  Cn+I  =  0.  Heretofore,  we  have  supposed  the  cost  p(b)  of  purchasing  b  units  of  promo¬ 
tion  in  a  period  is  linear  in  b,  and  that  g  and  g  are  concave  and  convex,  respectively,  and  non¬ 
negative  and  increasing.  From  now  on,  we  drop  these  assumptions  and  assume  only  that  g  is 
nonnegative  and  increasing,  and  that  p(g~l(fi))  is  convex  in  ft  ^  0.  Also,  let  P(ft)  = 
p(g~'(t 3))  -  rft.  Thus,  each  additional  unit  of  added  demand  in  a  period  is  at  least  as  expen¬ 
sive  as  its  predecessor.  Under  Conditions  (3-6)  with  D~p(g~l(°°))  replacing 
(p  -  \.0p')D~g~l(°°),  an  optimal  solution  will  exist  when  the  problem  is  expressed  in  the  origi¬ 
nal  costs. 

We  will  show  that  under  the  linearity  of  P  and  H,  the  problem  at  hand  is  equivalent  to  the 
usual  inventory  model  with  disposal  as  discussed  by  Fukuda  [4).  If  Pis  linear  the  result  of 
Theorem  1  can  be  sharpened  in  the  following  way.  Let  S'(z)  —  S(z)  -  Pz  and  z  —  y  -  ft  then 
Eq.  (13)  becomes 

(14)  C"(x)  -  min  {//(x)  +  Px  +  S'(z)  +  A.£C',+,[t?(z  -  (/)]}, 

X^Z 

so  ftn(x)  —  (x  —  z**")+,  where  z**n  is  the  greatest  z  minimizing  the  sum  of  the  last  two  terms 
in  braces  in  Eq.  (14)  over  R.  This  result  is  summarized  below. 

LEMMA  1:  If  P  is  linear  on  R+,  then  ft"(x)  —  (x  -  z**")+,  ftn(x)  —  (x  -  z**")+ 
VO'*"  -  z**")  —  (x  v y*n  —  z**")+,  and  z"(x)  =  z**"  A(x  Vy*"). 

REMARK:  If  y*"  ^  z**n,  then  z"(x)  —  z**"  =  z*"  for  all  x  and  if  y*"  <  z**n,  then 

y*n  _  z*n  for  x  ^  y*n 

(15)  z"(x)  -  x  for  y,n  <  x  <  z**n . 

z**"  for  z**"  <  x 

An  important  question  here  is:  when  is  y *"  <  z**"?  We  already  know  that  y*n  <  z*,n  implies 
ft  *"  —  0.  Therefore  y*a  is  the  least  y  minimizing 


(16) 


\H(y)  +  SO-)  +  A£C"+i[ijO  -  tOl! . 
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By  comparing  Eqs.  (14)  and  (16),  we  obtain  that  D  H(y*n)  +  P  >  0  implies  y*n  <  z**\ 
Similarly,  a  necessary  condition  for  y*"  <  z**"  is  D+H(y*n)  +  P  >  0. 

In  the  rest  of  this  subsection  we  assume  also  that  H  is  linear,  so  Eq.  (13)  becomes 
Cn(x)-  min  [Hy  +  P0  +  S(y  -  /3)  +  X£C',+1hO'  -  /3  -(/)]) . 

y>x.0>  0 

On  letting  z  “  y  -  ) 3,  we  have 

C"(x)  -  min  {(//  +  P)y  -  Pz  +  S(z)  +  X£Cn+,h(z  -  £/))) . 

y>:Vi 

Since  H  +  P  >  0  by  Condition  (4),  y  -  z  V  x  in  the  above  recursion  which  then  reduces  to 
C"(x)  -  min  {(//  +  £)(z  V  x)  -  Pz  +  S(z)  +  XECn+'[r,(z  -  1/)]) 

-  min  {Hx  +  H{z  -  x)+  +  P(z  -  x)~  +  S(z)  +  X£C"+lMz  -  1/)])  - 

Z 

Let  C"(x)  =*  C"(x)  —  Hx  and  S(z)  —  S(z)  +  \H'Ei}(z  —  £/),  so  the  new  recursion  is 
C"(x)  -  min  [H(z  -  x)+  +  P(z  -  x)~  +  S(z)  +  X£C',+i[t}(z  -  £/)]} . 

Z 

This  is  the  usual  formulation  of  the  recursion  for  the  inventory  problem  with  disposal,  where 
y*n  is  the  base  ordering  level  and  z**"  the  base  disposal  level. 

7.  CONVEX  PRODUCT  ORDERING  COST 

In  this  section,  we  generalize  our  model  from  linear  to  convex  product  ordering  cost. 
Suppose  now  that  there  is  a  nondecreasing  ordering  cost  function  c,  where  c(z)  =  0  for  z  <  0. 
If  we  define  c  to  be  equal  to  £>+c(0),  then  the  right-hand  derivative  of  the  function  c(z)  = 
c(z)  -  cz  is  equal  to  zero  at  zero.  Under  this  transformation  Eq.  (13)  becomes 

(17)  C"(x)  =  min  {c Cv  -  *)  +  H(y)  +  £03)  +  S(j  -  P)  + 

y>x.0»o 

KECn+llv(y  —  P  -  tfll). 

where  H,  P,  and  S' are  defined  by  Eqs.  (5),  (6),  and  (7),  and  C"+i(a r)  =  0.  Under  Conditions 
(3-6),  L„(y,  /3)  tends  to  infinity  with  |y|  +  /3  by  Proposition  2.  Since  c(y  -  x)  is  nonnegative 
for  all  Ln(y,l3,x)  =  Ln(y,(3)  +  c(y  -  x)  also  tends  to  infinity  with  \y\  +  for  every  x.  By 
Proposition  3,  B"(y,l 3,x)  does  similarly  upon  noticing  that  C”+1(x)  is  bounded  below.  There¬ 
fore,  an  optimal  solution  exists  for  every  x.  As  L„(y,/3,x)  is  a  convex  function  of  (. y,/3,x), 
C”(x)  is  convex  in  x,  provided  tj  is  linear. 

THEOREM  4:  If  tj  is  linear,  Ji"(x),  zn(x),  pn(,x),  x  -  y'Tx),  x  -  z"(x),  and  x  -  ^"(x) 
are  each  nondecreasing  in  x. 

PROOF:  Since  the  additional  term  c(/  —  x)  in  Eq.  (17)  relative  to  Eq.  (13)  is  subadditive 
in  Cy,x),  by  Proposition  5  Bn(x,y,(})  is  subadditive  in  (x,y,i 3),  and  by  Theorem  A2  y”(x)  and 
f}"(x)  are  nondecreasing  in  x.  Let  z  =  y  -  /3  in  Eq.  (17)  which  becomes 

(18)  C"(x)  =  min  {c(>  -  x)  + //O’)  +  £(y  -  z) -(- S(z)  +  X£C"+1[r)(z  -  t/)l)  • 

v  >  x.y  ^  r 

Since  P  and  c  are  convex,  the  term  inside  the  braces  is  subadditive  in  (x,v,z)  by  Examples  A6 
and  A8.  Therefore,  by  Theorem  A2  the  2-lexicographically  least  element,  ( Vv"(x ),  z"(x)) 
minimizing  the  right-hand  side  of  Eq.  (18)  is  nondecreasing  in  x. 
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Upon  letting  y'  =  x  —  y  and  0'  =  x  —  0,  Eq.  (17)  becomes 
(19)  Cn(x)  -  min  (c(-y')  +  H(x  -  y')  +  P(x  -  0')  +  S(0'  -  y')  + 

\EC+'[-nU3'  -  y'  -  (/)]). 

Since  H,  P ,  S  and  C'I+1  are  convex  and  rj  is  linear,  the  term  inside  the  braces  in  Eq.  (19)  is 
subadditive  in  (x,y',/3')  by  Examples  A6  and_A8.  Therefore,  by  Theorem  A2  the  {1,2}- 
lexicographically  least  element  (x  —  y"(x),x  -  0"(x))  minimizing  the  right-hand  side  of  Eq. 

(19)  is  nondecreasing  in  x.  Moreover,  x  —  z"(x)  =  (x  -  yn(x))  +  (3n  (x)  is  nondecreasing  in 
x  by  the  above,  completing  the  proof. 

8.  COMMON  TIME  LAG  IN  DELIVERY  AND  PROMOTION 

In  this  section,  we  generalize  our  models  by  allowing  a  common  interval  of  time  to  occur 
between  the  moments  of  product  ordering  and  of  promotion  purchase,  respectively,  and  the 
moments  of  the  delivery  of  the  product  and  of  the  promotion  effect.  This  interval  of  time,  v , 
where  v  is  a  nonnegative  real  integer,  is  called  a  time  lag. 

Let  x  be  the  inventory  on  hand  plus  on  order  minus  the  controlled  demands  of  periods  n 
to  n  +  v  —  1,  at  the  beginning  of  period  n.  Also,  0  is  the  controlled  demand  that  will  take 
effect  in  period  n  4-  v.  In  addition,  all  costs  are  incurred  at  the  beginning  of  period  n  +  v. 
Under  the  assumption  tj(x)  =  x,  recursion  (13)  becomes 

(20)  C"(x)  =  min  [H(y)  +  P(0)  4-  S(y  -  p)  +  \EC"+'(y  -  p  -  [/)} , 

0 

where  n  <  N -v,  H,  and  Pare  defined  by  Eqs.  (5)  and  (6),  and 
S(z)  =  Es(z  -  V)  -  Ac'z  , 

with  V  a  random  variable  whose  distribution  is  the  convolution  of  Fn . F„+„. 

Since  the  structure  of  Eq.  (20)  is  identical  to  that  of  Eq.  (13),  all  results  discussed  in  Sec¬ 
tions  2-6  apply  directly. 

9.  CONCLUSION 

This  paper  has  shown  the  existence  of  an  optimal  policy  to  an  inventory  problem  where 
the  demand  is  influenced  through  advertising  under  the  control  of  the  same  management  unit. 
The  present  approach  links  together  two  generally  separate  functions  of  firm  management, 
though  there  are  examples  of  such  a  link  in  the  real  world,  such  as  an  ad  campaign  for  an  end 
of  season  sale,  or  an  increase  in  inventory  by  merchants  prior  to  a  major  ad  blitz  by  a  producer 
or  by  themselves.  When  the  optimal  policy  for  inventory  level  and  advertising  level  is  charac¬ 
terized,  this  link  is  rendered  more  explicit  as  each  component  of  the  optimal  policy  is  increasing 
in  the  other  and  also  in  the  initial  levels  of  inventory  and  advertising. 

Numerically,  this  two-variable  dynamic  program  can  be  treated  like  two  one-variable 
dynamic  programs  since  the  optimal  policy  (x  v  f(/3  *  V«),  nV|i(xV  y*))  is  a  known  function 
of  two  single  variable  functions  y  and  0  over  the  range  (y*, «>)  and  (0*,°°),  respectively. 
Since,  in  general,  these  functions  cannot  be  obtained  in  closed  form,  they  are  evaluated  using 
one-variable  search  procedures.  Thus,  the  number  of  numerical  computations  necessary  to 
solve  this  problem  are  of  the  same  order  of  magnitude  as  that  of  a  one-variable  dynamic  pro¬ 
gram. 
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APPENDIX 

SUBLATTICES  AND  SUBADDITIVE  FUNCTIONS 

This  whole  appendix  summarizes  some  results  of  Topkis  [17]  and  Veinott  [23]  (forthcom¬ 
ing)  on  the  minimization  of  subadditive  functions  on  a  sublattice.  Throughout,  C  is  the  Carte¬ 
sian  product  Ci  x  ...  x  C„  of  a  finite  number  of  chains.  A  chain  is  a  completely  ordered  set. 
For  example,  the  real  line  R  is  a  chain. 

A  subset  I  of  a  partially  ordered  set  is  called  a  lattice  if  every  two  elements  s,  s'  6  L  have 
a  greatest  lower  bound  in  L ,  called  their  meet  and  denoted  s  A  s\  and  a  least  upper  bound  in  L, 
called  their  join  and  denoted  s  V  s'. 

A  subset  S  of  a  lattice  L  is  called  a  sublattice  of  L  if  S  is  a  lattice  and  if  the  meet  and  join 
of  every  two  elements  of  S  coincide  respectively  with  the  meet  and  join  of  those  same  two 
points  considered  as  elements  of  L. 

EXAMPLE  Al:  The  set  [s  €  R"\ausi  +  b^Sj  >  c,n  for  1  ^  i,  j  ^  n)  is  a  sublattice  of  Rn 
provided  that  <  0  for  all  1  <  /,  j  <  n ■ 

EXAMPLE  A2:  A  chain  in  a  lattice  is  a  sublattice. 

EXAMPLE  A3:  The  Cartesian  product  of  two  sublattices  is  a  sublattice. 

The  set  Ls  =  {s'  €  L'|(s,s')  €  5},  where  S'  is  a  sublattice  of  the  Cartesian  product  L  x  L' 
of  two  lattices,  is  called  a  section  of  S.  The  set  n^.S  =  UsiL  Ls  is  called  the  projection  of  S  on 

L'. 


EXAMPLE  A4:  The  section  Ls  and  the  projection  IJ^  S  are  sublattices  of  L. 

A  real  valued  function  /  on  a  lattice  L  is  called  subadditive  if  f  {x  Ay)  +  f  (x  V y)  ^ 
f  (x)  +  f(y)  for  all  x,  y  €  L,  and  superadditive  if  the  inequality  is  reversed.  The  assertion  that 
/  is  subadditive  on  the  product  Lx  L'  of  two  chains  is  equivalent  to  the  assertion  that  the 
mixed  second  differences  of  /  are  nonpositive  on  L  x  L'.  If  /  is  twice  continuously 
differentiable  and  Lx  L'  is  an  open  subset  of  R2,  then  /  is  subadditive  if  and  only  if  Dnf  ^  0 
on  L  x  L  '. 

LEMMA  Al:  A  function  / on  C  is  subadditive  if  and  only  if  /(s)  is  subadditive  in  (s,,s7) 
on  Cu  x  Cj  for  each  i  <  i  <  j  <  n  and  each  fixed  sk  6  Ck  with  k  r*  i,  j. 

For  the  next  three  examples,  we  define  the  following  functions: 

f:L  x  L'  —  R.  g\L  —  L,  ~g':L' -*  L,  h\L'  —  L',  and£:L  x  L *  L, 
where  L  and  L'are  subsets  of  R. 

EXAMPLE  AS:  If  f(s,t)  is  subadditive  on  L  x  L'and  g  and  h  are  nondecreasing  on  L  and 
L',  respectively,  then  fig(s),h  (t))  is  subadditive  in  (s,t)  on  L  x  L'. 


•MMH 


288 


Y.  BALCER 


EXAMPLE  A6:  If  L  +  L  —  L,  fis.t)  is  subadditive  on  L  x  L'  and  convex  in  s  on  L  for 
each  t  €  L\  and  if  g,  g'  and  h  are  nondecreasing  on  their  respective  domains,  then 
figis)  —  g'it),  hit))  is  subadditive  in  ( s,t )  on  Lx  L'. 

EXAMPLE  A7:  If  fis.t)  is  subadditive  on  L  x  L'  and  convex  nondecreasing  in  s  on  L  for 
each  t  €  L\  g  is  subadditive  on  L  x  L\  g  is  nondecreasing  on  L  and  nonincreasing  on  L\  and  h 
is  nondecreasing  on  L\  then  f(g(s.t),h  it))  is  subadditive  in  (s,t)  on  L  x  L'. 

EXAMPLE  A8:  The  set  of  subadditive  functions  on  a  lattice  is  a  convex  cone. 

LEMMA  A2:  If  /is  subadditive  on  C  where  C)  C  R,  F  is  a  chain  in  the  set  of  all  real 
valued  distributions,  and  git.F)  =  J  fis,t)dFis)  is  finite  for  all  it,F)  €  C2  x  •••  x  C,  x  f, 
then  g(t,F)  is  subadditive  in  ( t.F )  on  C2  x  •  •  ■  x  C„  x  F. 

THEOREM  Al:  (Projection  theorem).  If  5  is  a  nonempty  sublattice  of  the  product  L  x 
E' of  two  lattices,  /is  subadditive  on  S',  and  g(s)  =  infs  €i.s  f(s,s')  is  finite  for  s  €  II tS,  then  g 
is  subadditive  on  I1/-S. 

If  L\, . . .  ,  L„  are  partially  ordered  sets  and  L  C  L,  x  •  •  •  x  Ln,  we  say  s  €  L  is  lexico¬ 
graphically  smaller  than  t  €  L,  written  s  <  /,  if  either  s  =  /  or  s  /  /  and  j,  <  t,  for  /  = 
j,  where  j  is  the  smallest  index  /  for  which  s,  ^  If  A  is  a  subset  of  the  first  n  positive 
integers,  denote  by  LA  the  set  L  where  the  ordering  on  L,  is  reversed  for  each  i  €  A.  We  say  s 
€  L  is  /^-lexicographically  smaller  than  t  €  L,  written  s  </)r,  if  s  is  lexicographically  smaller 
than  t  in  LA. 

THEOREM  A2:  (Monotonicity  theorem).  If  S  is  a  nonempty  sublattice  of  the  product  L 
x  L'  of  two  lattices  where  L'  C  R",  if  Ls  is  compact  for  each  s  6  n^S,  and  if  /  is  subadditive 
on  S  and  continuous  on  Ls  for  each  s  G  n^S,  then  the  set  M(s)  of  points  minimizing  /(s,  •) 
over  Ls  is  a  nonempty  compact  sublattice  of  Ls  for  each  s  6  H/  S.  Also,  Mis)  has  least, 
greatest,  and  /1-lexicographicaliy  least  elements,  each  of  which  is  nondecreasing  in  s  on  II /  5. 
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ABSTRACT 

This  paper  deals  with  a  periodic  review  inventory  system  in  which  a  con¬ 
stant  proportion  of  stock  issued  to  meet  demand  each  period  feeds  back  into 
the  inventory  after  a  fixed  number  of  periods.  Various  applications  of  the 
model  are  discussed,  including  blood  bank  management  and  the  control  of 
reparable  item  inventories.  We  assume  that  on  hand  inventory  is  subject  to 
proportional  decay.  Demands  in  successive  periods  are  assumed  to  be  indepen¬ 
dent  identically  distributed  random  variables.  The  functional  equation  defining 
an  optimal  policy  is  formulated  and  a  myopic  base  stock  approximation  is 
developed.  This  myopic  policy  is  shown  to  be  optimal  for  the  case  where  the 
feedback  delay  is  equal  to  one  period.  Both  cost  and  ordering  decision  compar¬ 
isons  for  optimal  and  myopic  policies  are  carried  out  numerically  for  a  delay 
time  of  two  periods  over  a  wide  range  of  input  parameter  values. 


INTRODUCTION 

This  paper  deals  with  the  analysis  of  inventory  systems  in  which  recycling  occurs.  The 
term  recycling  is  used  here  to  indicate  that  a  fixed  fraction  of  the  stock  used  to  satisfy  demand 
returns  to  inventory  after  a  fixed  number  of  periods. 

Feedback  or  recycling  in  inventory  systems  can  occur  in  a  number  of  different  ways. 
Examples  include  systems  where  customers  buy  items  with  a  rent/purchase  option  and  return 
items  that  are  not  ultimately  purchased.  Another  cause  of  recycling  is  the  result  of  over¬ 
ordering  stock.  This  occurs  in  hospital  and  regional  blood  banks  since  physicians  requesting 
blood  for  their  patients  tend  to  over-order  by  a  factor  of  two  or  three.  A  further  application  of 
recycling  occurs  in  retail  sales  systems  where  a  fixed  fraction  of  stock  purchased  by  customers 
may  be  returned,  and  subsequently  mixed  with  existing  inventory. 


•Research  supported  by  Grant  ENG-77-07463  from  the  National  Science  Foundation 

••Research  supported  by  Grant  78-3494  from  the  Air  Force  Office  of  Scientific  Research  and  Grant  ENG  78-05928 
from  the  National  Science  Foundation.  Visiting,  Department  of  Operations  Research  Stanford  University,  1978-1979 


289 


290 


M.  A.  COHEN,  S.  NAHMIAS  AND  W.  P  PIERSKALLA 


The  phenomenon  of  recycling  can  also  be  observed  in  reparable  item  inventories.  An 
issue  stock  inventory  is  maintained  to  replace  items  in  the  field  which  are  subject  to  failure. 
Failed  items  are  returned  for  repair  and  after  a  fixed  delay  time  (which  includes  the  time 
required  for  transportation  and  repair),  the  repaired  item  is  returned  to  the  issue  stock  inven¬ 
tory.  A  fraction  of  those  items  that  fail  are  condemned  and  leave  the  system  forever.  The 
reparable  item  system  is  pictured  in  Figure  1. 


FRACTION  <1»)  OF  ITEMS 


Figure  1. 


Previous  analyses  of  this  class  of  inventory  system  have  been  restricted  primarily  to  simu¬ 
lation  studies  of  blood  banks  (Cohen  and  Pierskalla  [2]),  or  systems  where  demand  is  deter¬ 
ministic  and  the  proportion  of  stock  recycling  is  treated  as  a  random  variable  (Cohen,  Nahmias 
and  Pierskalla  [3]).  Related  reparable  item  inventory  models  include  Prawda  and  Wright  [4] 

and  Allen  and  D’Esopo  [1].  This  paper  treats  the  case  where  demand  (or  failure)  is  stochastic 

and  where  the  fraction  of  stock  that  feeds  back  into  the  system  is  fixed. 

The  paper  begins  with  a  description  of  notation  and  assumptions  for  a  general  model  with 
arbitrary  recycle  lag  and  stochastic  demand  and  the  functional  equation  satisfied  by  the  optimal 
order  policy  is  formulated.  A  myopic  approximation  to  the  optimal  order  policy  and  conditions 
for  its  optimality  are  derived  for  the  case  of  an  arbitrary  recycle  period.  The  optimality  of  this 
policy  for  the  case  where  the  recycle  period  is  equal  to  one  is  then  demonstrated. 

The  final  section  reports  on  the  results  of  a  numerical  analysis  comparing  optimal  and 

myopic  policies  for  a  variety  of  cases  where  the  recycle  lag  is  equal  to  two  periods.  These 

results  suggest  that  the  myopic  policy  provides  a  very  effective  approximation  to  the  optimal. 


i 
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MODEL  ASSUMPTIONS  AND  NOTATION 

A  periodic  review  inventory  system  with  the  following  features  is  considered: 

•  Successive  demands  {/>,}  are  independent  and  identically  distributed  random  variables 
with  known  cumulative  distribution  function  F(-)  and  density  /(•). 

•  A  fixed  fraction,  a,  of  stock  issued  to  meet  demand  is  returned  after  a  delay  of  k  > 
1  periods.  The  fraction,  1  -  a,  is  consumed. 

•  A  fixed  fraction,  /3,  of  stock  on  hand  at  the  end  of  each  review  period  survives, 
without  decay,  into  the  next  period  and  the  fraction,  1  —  /3,  is  lost  to  decay. 

•  Excess  demand  is  lost  in  each  period. 

•  There  is  no  leadtime  for  ordering.  That  is,  orders  are  received  in  the  period  in  which 
they  are  placed. 

•  Excess  demand  is  lost  (lost  sales). 

Time  periods  are  numbered  forward  by  integers  n,  n  =  1,  2,  ,  T,  where  T  is  the  decision 

horizon  for  the  problem. 

The  following  variables  describe  the  state  of  the  system  each  period: 

/  =  (/ . /*_ |)  is  the  vector  of  stock  quantities  issued  to  meet  demand  in  the  previ¬ 

ous  A  — 1  periods.  Interpret  /,  as  the  quantity  issued  exactly  /  periods  previously. 

u  —  starting  inventory  before  ordering  but  after  the  arrival  of  recycled  stock  in  the 
current  period. 

r  =  inventory  on  hand  after  ordering  and  after  returns  in  the  current  period. 

The  state  of  the  system  at  any  point  in  time  is  described  by  the  vector  iuj ),  and  the  decision 
variable  (the  order  quantity),  is  given  by  z  —  u. 

We  will  adopt  the  common  conventions  that  the  holding  cost  function  hi-)  and  the  shor¬ 
tage  cost  function  pi-)  are  convex  functions  of  ending  stock  in  each  period.  The  outdating  cost 
is  assumed  to  be  0  per  unit  of  stock  that  outdates  at  the  end  of  each  period,  and  the  procure¬ 
ment  cost  is  c  per  unit.  If  follows  that  the  one  period  expected  holding,  shortage,  and  outdate 
cost  function,  say  L  (z),  is  a  convex  function  of  the  starting  stock  z,  and  is  given  by 

Liz)  =  E[h[iz  -  Z»+]  +  p[(D  -  z)+]  +  0(1  -  /3)  (z  -  Z))+|, 

=  J*o  hiz  -  t)fit)dt  +  f  pit  -  z)fit)dt  +  0(1  -  /3)J*0  (z  -  t)fit)dt. 

Assuming  that  future  costs  are  discounted  by  a  where  0  <  a  <  1,  it  follows  that  the 
functional  equations  defining  an  optimal  policy  are  given  by: 


i 


■joaaa 
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Cn(u,l )  -  min 


c(z  -  u)  +  L(z) 


+  «/0  Cn+1  (/3<z  -  /)  +  n/x_,,r,/ i . IK-2)f(t)dt 

+  “X  c*+i  (a/x_„z,/i . /x-j)/(/)^| 


for  n  >  1  and  Cy+|  (•)  =  0.  Note  that  this  is  equivalent  to  assuming  that  stock  remaining  at  the 
end  of  the  horizon  is  not  salvaged.  Other  salvage  assumptions  are  possible  and  will  be  con¬ 
sidered  in  the  next  section. 


Here  C„  (u,I)  has  the  interpretation  as  the  minimum  expected  discounted  cost  at  the  start 
of  period  n  when  u  is  the  starting  stock  after  returns,  and_/  is  the  vector  of  previously  issued 
stock. 


The  process  dynamics  are  implied  in  the  functional  equations  above.  Let  t  be  the  realiza¬ 
tion  of  demand  in  period  n.  Then  there  are  two  cases: 


(a)  /  <  z.  In  this  case  (  -  /3)  (z  -  t)  is  lost  due  to  decay  and  /3(z  -  r)  transfers  to  the 
next  period  which  combines  with  the  stock  which  recycles  in  period  n  +  I,  a/x_ j. 
Exactly  t  units  are  issued  to  meet  the  demand.  In  this  case  if  (u,l)  is  the  state  vector 

in  period  n,  it  follows  that  (fi (z  —  t)  +  aIK_\,t,l\ . /x_2)  is  the  state  vector  in 

period  n  +  1. 

(b)  t  >  z.  In  this  case  ending  stock  in  period  n  is  zero  and  no  stock  decays  or  is 

transferred  to  the  following  period.  Starting  stock  the  next  period  consists  only  of 
the  stock  which  recycles  in  period  n  +  1,  which  is  a/x_ Since  only  z  can  be  issued 
to  meet  demand,  the  state  vector  one  period  hence  is  (a/x_i,z,/| . /x_2). 

The  optimal  policy  is  to  order  the  max  ( z„  (/  )  -  u,  0)  where  z„(/),  the  order  to  point, 
minimizes  the  bracketed  term  on  the  right  hand  side  of  the  functional  equation  above.  Compu¬ 
tation  of  an  optimal  policy  will  be  difficult  due  to  the  limitations  of  dynamic  programming  with 
vector  valued  state  variables.  However,  for  A  —  1  under  reasonably  general  conditions  the 
optimal  policy  can  be  shown  to  reduce  to  a  single  critical  number  in  each  period.  In  addition,  a 
critical  number  approximation  is  derived  for  the  case  A  >  1. 

A  MYOPIC  CRITICAL  NUMBER  APPROXIMATION 

We  will  assume  as  above,  that  periods  are  numbered  forwards  and  the  planning  horizon  is 
exactly  T  periods  where  A  <  T  <  +<».  We  ignore  the  case  T  <  A,  as  the  feedback  process 
will  not  be  relevant,  and  the  optimal  policy  reduces  to  the  ordinary  critical  number  order  policy. 
The  variables  u„  and  z„  are  still  to  be  interpreted  as  starting  stock  after  returns  before  and  after 

ordering  respectively  in  period  n.  In  addition,  let  ( /( 1 ) . /(A))  represent  stock  issued  in  the 

final  A  periods.  That  is,  / ( 1 )  is  issued  in  period  T  —  A  +  1,  / (2)  is  issued  in  period  T  -  A  +  2, 

. . .  ,  and  /(A)  is  issued  in  period  T. 

In  order  to  construct  the  approximation  we  will  need  to  assume  that  all  stock  remaining  in 
the  system  at  the  end  of  the  horizon  can  be  salvaged  at  a  return  equal  to  the  purchase  cost  of  c 
per  unit.  This  includes  stock  on  hand  at  that  time,  (wr+  j ) ,  and  the  stock  issued  in  the  final  A 

periods  of  the  horizon  l(\) . /(A).  In  addition,  we  assume  that  the  issued  stock  cannot  be 

salvaged  until  it  returns  to  inventory.  Hence  al(  1)  is  salvaged  in  period  T  +  1,  o/(2)  in  period 
T  +  2,  ...  ,  and  a/ (A)  in  period  T  +  A.  (The  salvage  assumption  was  first  used  by  Veinott 
(51.) 
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With  these  assumptions,  the  total  discounted  cost  over  T  periods,  say  TC(z),  when  fol¬ 
lowing  the  ordering  policy  z  —  (zlt. . .  ,zT)  is 


TC(z )  -  £  a"  '  ~  -  <*TcuT+ !  -  c  £  a 


n  - 1 


„T+k- 1 


al(k). 


k- 1 


The  process  dynamics  imply  that 

P(z„-\  -  Dn-\)+  for  2  <  n  ^  X. 

”  /3(z„_i  -  Dn-i)+  +  a  min  (zfl_x,D„_x)  for  A.  +  1  <  n  <  T  +  1 
and  l(k)  =  min(zr-x+*,£r-x+*)  for  1  <  k  ^  A. 

By  a  rearrangement  of  terms,  one  can  show  that 
TC(z)~  £  a”-'W{z„)-  cut. 

n- 1 

where 

^(zj  ”  E[c(zn  -  a/3 (z„  -  D„)+  -  otK  a  min  (z„  ,£>„)]  +  L(z„)  for  1  <  T. 


We  have  the  following  result: 
THEOREM  1:  Assuming 


1.  All  inventory  on  hand  in  periods  T  +  1 . T  +  A  can  be  salvaged  in  that  period 

at  a  return  of  c  per  unit. 

2.  p'(0)  >  (1  -  aAa)  c 

3-  3* 

4.  u  |  <  z*. 


a  +<3  -  1 


zT-  1  far  1  <  n  <  T 


where  z*  is  the  minimizing  point  of  W(z)  and  is  the  root  of  the  equation  W'{z*)  —  0,  then  the 
optimal  policy  is  to  order  to  z*  every  period. 

T  T 

PROOF:  Since  £  (z*)  <  £  W„(z),  it  follows  that  z*is  the  optimal  order  to  point  if 

n-l  n- 1 

z*  can  be  achieved.  It  will  be  possible  to  order  to  z*  in  period  n  if  and  only  if  z*  -  u„  >  0. 
Following  the  policy  z*  in  every  period  implies  that 

u„  -  /3(z*  -  D„-\)+  +  a  min  (z*,A,_x) 

^  /3(z*  -  Dn-\)+  +  az*  -  max(az*,  (a  +  /3)  z*  - 

Clearly  az *  <  z*.  By  assumption  3,  (a  + /3)  z*  -  /3Z>„_!  <  (a  + /3)  z*  —  (a  + /3  -  1) 
z*  —  z*,  hence,  u„  <  z*. 


Since  IF(z)  is  convex  in  z,  and  by  assumption  2,  IF'(O)  <  0,  we  have  that  z*  >  0. 

□ 
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COROLLARY:  For  A  -  1,  P 


D.  > 


a  +  P  - 


P 


1  for  1  <  n  <  T,  and  hence 


the  myopic  policy  defined  in  Theorem  1  is  optimal  when  only  assumptions  (1),  (2)  and  (4) 
hold. 


PROOF:  For  A  =»  1  we  have, 

un  =  p(z*  -  £>„_ i)+  +  a  min(z*,  />„_]), 
and  it  follows  easily  that  u„  <  z*  for  all  realizations  of  £)„_]. 


□ 


When  all  costs  are  linear,  z*  will  be  given  by. 


(1) 


F~l 


_ p  -  c(l  —  axg) _ 

p  +  h  +  0(\  —  p)  —  ac(p  —  ax~’a) 


Assumption  3  in  Theorem  2  is  somewhat  tautological  when  A  >  2,  since  z*  depends  on 
the  distribution  of  D„ .  When  this  assumption  does  not  hold,  it  may  not  be  possible  to  order  to 
z*  in  every  period  as  it  will  not  necessarily  be  true  that  u„  $  z*.  However,  since  the  expected 
cost  function,  W(z),  tends  to  be  relatively  flat  in  a  neighborhood  of  the  minimum,  it  seems 
reasonable  to  conjecture  that  when  assumption  3  is  not  met,  ordering  (z*  -  u„)+  in  every 
period  should  give  a  good  approximation. 

In  order  to  test  this  conjecture,  numerical  computations  are  performed  for  A  =  2  in  the 
next  section.  Dynamic  programming  is  used  to  compute  the  optimal  stationary  policy  which  is 
then  compared  to  z*  for  a  variety  of  demand  distributions  and  cost  configurations. 

NUMERICAL  COMPARISONS  FOR  A  =  2 

A  series  of  runs  were  carried  out  for  a  variety  of  configurations  of  the  system  parameters 
to  compare  the  effectiveness  of  the  approximation  to  the  optimal  policy  when  the  recycle  delay 
was  two  periods.  In  order  to  reduce  the  number  of  different  factors  considered,  the  cost  param¬ 
eters  (c,h,p)  are  combined  into  the  single  constant  m  =  (p  -  c)/(p  +  h)  (which  is  motivated 
by  the  solution  to  the  newsboy  problem).  For  each  demand  distribution  the  following  factors 
and  levels  are  considered: 


(1) 

cost  ratio, 

m  €  |.5,  .75,  .95) 

(2) 

return  fraction, 

a  €  {.2,  .5,  .8) 

(3) 

outdate  cost. 

9  €  (1,  2) 

(4) 

outdate  fraction. 

P  €  1.2,  .8) 

These  factor  levels  lead  to  a  36  case  experiment.  The  discount  factor  a  was  fixed  at  .95, 
order  cost  c  at  1  and  holding  cost  h  at  .5.  The  required  values  of  m  were  achieved  by  setting  p 
at  2.5,  5.5  and  28.5,  respectively. 

The  total  36  case  experiment  was  run,  for  uniform,  exponential  and  geometric  distribu¬ 
tions  each  with  a  mean  value  of  five  which  resulted  in  a  total  of  108  cases.  The  output  for  a 
typical  case  is  illustrated  in  Table  1.  The  optimal  solution  was  computed  by  standard  value 
iteration  techniques  and  the  myopic  policy  was  computed  from  0).  A  30  period  horizon  was 
selected  to  minimize  transient  effects.  Convergence  to  the  stationary  optimal  policy  generally 
occurred  in  ten  periods  or  less.  We  note,  from  Table  I,  that  both  the  optimal  solution  and  cost 
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penalty  for  using  the  myopic  approximation  are  relatively  insensitive  to  changes  in  the  initial 
inventory  level.  The  maximum  percent  cost  penalty  for  using  the  myopic  policy  is  0.3%  in  this 
case. 


TABLE  l— Optimal  and  Approximate  Policies  for  the  Case 
m  *  .5,  a  “  .2,  0  “  1. 1 3  ”  .8,  Poisson  Demand 


Initial 

Inventory 

I 

Optimal 

Order 

Function 

(z*(/)) 

Myopic 
Order-up 
-to  Level 
(z*) 

Average  Cost 
per  Period 

%  Difference 
in  Cost 

using  z*(7) 

using  z* 

0 

6 

5 

3.232 

3.240 

0.2 

1 

6 

5 

3.227 

3.237 

0.3 

2 

6 

5 

3.225 

3.236 

0.3 

3 

5 

5 

3.211 

3.218 

0.2 

4 

5 

5 

3.205 

3.212 

0.2 

5 

6 

5 

3.200 

3.208 

0.3 

6 

6 

5 

3.196 

3.205 

0.3 

7 

6 

5 

3.193 

1.204 

0.3 

8 

5 

5 

3.179 

3.186 

0.2 

9 

5 

5 

3.173 

3.181 

0.3 

10 

6 

5 

3.169 

3.176 

0.2 

Table  II  summarizes  36  runs  selected  from  the  set  of  108  runs.  The  runs  illustrated  were 
selected  by  taking  the  worst  case  (that  is,  the  largest  cost  error)  over  the  three  demand  distribu¬ 
tions  for  each  case.  Optimal  and  myopic  policies  and  costs  for  just  the  single  initial  inventory 
level  of  five  are  indicated,  since,  as  noted  above,  the  results  are  not  sensitive  to  the  initial 
inventory  level.  Table  II  also  indicates  the  maximum  percent  cost  differences  for  each  case 
taken  over  all  initial  inventory  level  values.  Note  that  the  maximum  percent  cost  penalty 
ranges  from  0.0%  to  6.9%  over  all  factor  values.  In  addition,  also  note  that  71%  of  all  108  cases 
had  a  maximum  cost  difference  of  less  than  1%  and  that  only  6.5%  had  a  maximum  cost 
difference  of  more  than  5%. 

It  seems  reasonable  to  conjecture  that  the  myopic  policy  will  also  provide  a  good  approxi¬ 
mation  for  values  of  A,  the  recycle  delay  parameter,  larger  than  two  as  well.  The  approximation 
has  the  dual  advantage  of  being  both  easy  to  compute  and  easy  to  implement.  The  model 
presented  here  is  applicable  to  a  variety  of  inventory  problems  where  stock  recycling  is  present, 
including  blood  bank  inventory  control  and  reparable  item  management. 
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TABLE  II—  Worst  Case  Optimal  and  Myopic  Cost  Comparisons 


Run 

# 

m 

a 

— 

theta 

beta 

Demand 

CDF 

Optimal 
r*  (5) 

Optimal 
Average 
Cost  at 
/  -  5 

Approx. 

Policy 

r* 

Average 
Cost  of 
Approx. 
Policy 

Maximum 
%  Penalty 

1 

.50 

.2 

l 

.2 

Geometric 

2 

3.372 

1 

3.501 

4.0 

2 

.75 

.2 

1 

.2 

Poisson 

6 

4.802 

5 

5.026 

4.7 

3 

.95 

.2 

I 

.2 

Uniform 

10 

7.658 

9 

7.969 

4.1 

4 

.50 

.5 

1 

.2 

Poisson 

5 

3.041 

4 

3.198 

5.2 

5 

.75 

.5 

1 

.2 

Uniform 

7 

5.542 

7 

5  542 

0.0 

6 

.95 

.5 

1 

.2 

Uniform 

10 

6.890 

9 

7.21 

4.7 

7 

.50 

.8 

1 

.2 

Poisson 

5 

2.566 

4 

2.745 

6.9 

8 

.75 

.8 

1 

.2 

Uniform 

7 

4.932 

7 

4.932 

0.0 

9 

.95 

.8 

l 

.2 

Uniform 

10 

6.188 

9 

6.52S 

5.5 

10 

.50 

.2 

2 

.2 

Geometric 

1 

3.589 

1 

3.593 

.1 

11 

.75 

.2 

2 

.2 

Poisson 

5 

5.394 

5 

5.394 

0.0 

12 

.95 

.2 

2 

.2 

Poisson 

8 

7.980 

7 

8.403 

5.3 

13 

.50 

.5 

2 

.2 

Uniform 

3 

4.649 

3 

4.649 

0.0 

14 

.75 

.5 

2 

.2 

Uniform 

7 

6.603 

6 

6.639 

0.1 

15 

.95 

.5 

2 

.2 

Poisson 

8 

7.236 

7 

7.655 

5.9 

16 

.50 

.8 

2 

.2 

Geometric 

2 

3.095 

1 

3.188 

3.7 

17 

.75 

8 

2 

.2 

Poisson 

6 

4.101 

5 

4.193 

2.2 

18 

.95 

.8 

2 

2 

Poisson 

8 

6.557 

7 

7.003 

6.7 

19 

.50 

.2 

1 

.8 

Poisson 

6 

3.200 

5 

3.208 

0.4 

20 

.75 

.2 

1 

.8 

Geometric 

6 

3.652 

5 

3.728 

2.1 

21 

.95 

.2 

1 

.8 

Geometric 

9 

4.608 

9 

4.609 

0.0 

22 

.50 

.5 

1 

.8 

Uniform 

7 

3.161 

6 

3.217 

1.9 

23 

75 

•5 

.8 

Uniform 

9 

3.666 

8 

3.696 

0.8 

24 

.95 

.5 

.8 

Poisson 

9 

3.655 

9 

3.656 

0.0 

25 

.50 

.8 

1 

8 

Geometric 

3 

2.162 

4 

2.185 

2.6 

26 

.75 

.8 

1 

.8 

Geometric 

6 

2.903 

6 

2.926 

1.4 

27 

.95 

.8 

1 

8 

Geometric 

9 

3.883 

9 

3.885 

0.2 

28 

50 

.2 

2 

.8 

Uniform 

6 

4.090 

6 

4.093 

0.1 

29 

.75 

.2 

2 

.8 

Poisson 

7 

3.944 

6 

4.030 

2.3 

30 

.95 

.2 

2 

.8 

Poisson 

9 

4.828 

8 

5.060 

4.8 

31 

.50 

.5 

2 

.8 

Uniform 

6 

3.417 

6 

3.427 

0.4 

32 

.75 

.5 

2 

.8 

Poisson 

7 

3.208 

6 

3.323 

3.6 

33 

.95 

.5 

2 

.8 

Poisson 

9 

4.086 

8 

4312 

5.5 

34 

50 

.8 

2 

8 

Poisson 

6 

2.070 

5 

2.092 

1.3 

35 

.75 

8 

2 

.8 

Poisson 

7 

2.629 

6 

2.699 

2.1 

36 

.95 

2 

.8 

Poisson 

9 

3.499 

8 

3.639 

3.9 

REFERENCES 

tl)  Allen,  S.G.  and  D  A.  D’Esopo,  "An  Ordering  Policy  for  Repairable  Stock  Items,"  Opera¬ 
tions  Research,  16,  669-675  (1968). 

(2)  Cohen,  M.A.  and  W.P.  Pierskalla,  "Management  Policies  for  a  Regional  Blood  Bank," 
Transfusion,  15,  58-67  (1975). 

[31  Cohen,  M.A.,  S.  Nahmias  and  W.P.  Pierskalla,  "A  Feedback  Inventory  Model  of  a  Hospital 
Blood  Bank,"  Proceedings  of  the  Seventh  Annual  Pittsburgh  Modelling  and  Simulation 
Conference,  Instrument  Society  of  America,  612-616  (1976). 

[41  Prawda,  J.  and  G.P.  Wright,  "On  a  Replacement  Problem,"  Cahiers  du  Centre  d'Etudes  de 
Recherche  Operationnelle,  14,  43-52  (1972). 

[51  Veinott,  A.F.,  Jr.,  "Optimal  Policy  for  a  Multi-procduct,  Dynamic,  Non-stationary  Invento¬ 
ry  Problem,"  Management  Science,  12,  206-222  (1965). 


SENSITIVITY  ANALYSIS  AS  A  MEANS  OF  REDUCING 
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ABSTRACT 

Sensitivity  analysis  of  the  transportation  problem  is  developed  in  a  way 
which  enables  reducing  the  dimensionality  of  the  associated  tableau.  This  tech¬ 
nique  is  used  to  reduce  the  dimensionality  of  a  transportation  problem  whose 
origin  requirements  are  relatively  small  at  the  majority  of  origins.  A  long  trans¬ 
portation  problem,  for  which  efficient  solution  procedures  exist,  results.  A 
second  application  relates  to  the  location-allocation  problem.  Reducing  the 
dimensionality  of  such  a  problem,  accompanied  by  the  partial  determination  of 
the  optimal  solution,  should  prove  helpful  in  the  quest  for  an  analytic  solution 
to  the  aforementioned  problem.  In  the  meantime,  reducing  dimensionality 
greatly  decreases  the  effort  involved  in  solution  by  trial  and  error.  Examples  of 
the  two  applications  are  provided. 


1.  INTRODUCTION 

The  sensitivity  analysis  of  the  transportation  problem  has  been  thoroughly  developed  in 
the  classical  papers  of  Srinivasan  and  Thompson  [7],  The  latter  investigators  described  how  to 
find  the  optimal  solution  of  a  transformed  problem,  given  the  optimal  solution  of  the  original 
problem.  Finding  the  new  solution  is  somewhat  easier  when  the  optimum  basis  structure  is 
preserved.  Otherwise,  one  proceeds  from  basis  to  basis  until  arriving  at  the  optimal  basis. 

The  present  paper  takes  a  different  approach  to  sensitivity  analysis.  Rather  than  obtaining 
specific  values  for  all  members  of  the  optimal  solution,  we  determine  the  values  of  selected 
variables  in  a  way  which  enables  reducing  the  dimensionality  of  the  transportation  tableau.  The 
greater  the  similarity  between  the  original  and  transformed  problems,  the  more  the  tableau  may 
be  reduced  in  size.  Larger  transformations  do  not  permit  as  large  a  reduction  in  tableau  size, 
leading  to  a  greater  expenditure  of  time  in  the  solution  of  the  resulting  tableau. 

The  aforementioned  technique  has  been  described  exhaustively  in  a  paper  by  Intrator  and 
Paroush  [5].  It  is  not  the  purpose  of  the  present  work  to  compare  these  two  methods  of  per¬ 
forming  sensitivity  analysis  as  far  as  efficiency  and  ease  of  implementation  are  concerned, 
although  the  authors  feel  that  doing  so  would  indeed  be  a  worthwhile  task.  Rather,  a  single 
facet  of  the  new  approach  is  enlarged  upon,  and  how  this  aspect  may  be  utilized  in  speeding  up 
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the  solution  of  two  problems  not  generally  associated  with  sensitivity  analysis  is  explained. 
More  important  than  the  savings  in  computer  time,  however,  is  the  fact  that  reducing  the  size 
of  the  tableau  in  the  case  of  the  location-allocation  problem  may  serve  as  a  stepping-stone  in 
the  eventual  determination  of  an  analytic  solution  for  the  latter  problem. 

2.  DEFINITIONS  AND  THEOREMS 

Consider  a  transportation  problem  A  having  cost  matrix  C  =  (c^)  and  optimal  solution 
(x-y),  and  a  second  problem  A'  identical  to  the  first  in  all  respects  except  for  the  fact  that  the 
cost  matrix  has  been  transformed  to  C'  —  (cj),  leading  to  a  new  solution  (x 

Consider  the  simple  loop 

L  =  (i\Ji)ViJ2)02J2). .  .(ikJk+\) 
for  which  jk+i  —  yj.  Then  the  following  quantities  may  be  defined: 

^  (~L  =  CV|  ~  ci,j2  +  ci2j2  ~  ci2j3  ~  cikjk  + 1  • 


(2) 


Q  c/,y ,  citj2  +  ci2J2  ci2j)  cV4+i^ 


The  8 L  transformation  changes  any  feasible  solution  of  the  original  problem,  (y,y),  to  a 
new  solution  (y,j)  by  defining  the  members  of  (yj)  as  follows: 


(3) 


yv- 


y0  +  8,  if  U,j)  =  (i/Ji),  i.e.,  if  (i,j)is 

an  odd-numbered  member  of  L. 
ytj  -  8,  if  (ij)  =  (ii.ji+i).  i.e.,  if  (i,j)is 

an  even-numbered  member  of  L. 
yijt  if  (i,j)  is  not  a  member  of  L. 


Before  executing  the  8Z.  transformation,  both  the  loop  L  and  the  real  number  8  must  be  deter¬ 
mined.  If  they  are  chosen  such  that  (yj)  is  also  a  feasible  solution,  then  8L  is  termed  a  feasi¬ 
ble  transformation. 


For  a  given  loop  L  and  8  <  0,  the  8 L  transformation  will  be  feasible  if  the  odd-numbered 
cells  of  the  loop  are  all  positive  (basic)— a  requirement  that  ensures  that  ytJ  +  8  will  not 
become  negative.  In  particular,  it  is  necessary  that  -8  <  min  (Eq.  (3)).  If  8  >  0,  the 

transformation  is  feasible  if  the  even  numbered  cells  are  positive— a  requirement  that  ensures 
that  y:j  —  8  will  not  become  negative.  In  particular,  it  is  necessary  that  8  <  y,i  t. 

More  concisely,  a  feasible  transformation  is  one  for  which 


(4) 


-  min  yih 


<  8  < 


mm 


Letting  z  and  z*  represent  the  value  of  the  objective  function  before  and  after  the 
transformation,  respectively,  then 


+  Il  may  be  assumed  without  loss  of  generality  that  both  Q  and  0/  never  equal  zero,  since  it  is  always  possible  to  ap¬ 
propriately  perturb  the  costs  c„  and  r„. 
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z*  -  r  =  II c„y'  -  II c„y„  =  £  c’,u  O'.,/,  +  s> 


+  Z  c.,/,.10'.,i,H - 8)  +  ZZ^.t.. -  II 

K/s$A  (/./)?/.  /  / 


(5) 


-« I  c‘,j, - 8 1 


UK* 


W/+1 


=  8  ■  Q. 


If  8Q  <  0,  the  objective  function  has  been  reduced  and  8L  is  called  an  improving 
transformation.  If  8Q  >  0,  81  is  called  an  impairing  transformation. 

LEMMA  1:  The  solution  (xj)  of  the  transformed  problem  A  'may  be  obtained  from  the 
solution  Or,,)  of  the  original  problem  A  by  performing  a  finite  sequence  of  feasible  transforma¬ 
tions.  The  transformations  are  each  improving  with  respect  to  A '  and  impairing  with  respect  to 
A. 


PROOF:  If  (x,j)  =  (x,y),  no  transformations  are  required,  and  the  theorem  is  true  in  a 
trivial  fashion.  Assume,  on  other  hand,  that  (xy)  ^  (x^).  Since  'LL*u =  Ifl- =  I  a'j  (since 

,  u  i 

the  only  changes  are  in  the  cost  matrix)  =  ZZ  *//>  follows  that  there  exists  at  least  one  cell, 

•  — ^  o  ,  , 

say  (ijjj),  such  that  x,i7|  >  x/(>  .  Since  Z-V  =  a,,  =  «/,  =  Z  x< .  there  must  exist  h  ^  J l 

j  ,  ,  J 

such  that  x,’,2  >  x,^.  Since  Z  =  bj2  =  fy,  =  Z  xu2'  there  must  ex’sl  '2  ^  '1  such  that 

/  i 

x(j>,  >  x,'2Jr  The  process  continues  until  a  previously  utilized  row  or  column  is  encountered. 
In  this  manner ,  a  loop  L,,  containing  exactly  two  cells  in  each  row  and  column,  is  obtained. 


By  construction,  for  each  odd-numbered  loop  member 


x>,h  >  x',j, 


x„/;)  <  0. 


For  each  even-numbered  loop  member 


X'iJ)*  1  <  ’’Wi 


(jW,  -  •</,♦,  >  <  0- 


In  moving  from  /l  to  A'  it  is  necessary  to  add  to  the  even-numbered  cells  of  the  loop  and  sub- 
stract  from  the  odd-numbered  cells,  an  objective  which  may  be  achieved  by  executing  a  8Z. 
transformation  with  8  <  0.  In  particular,  let 

<6>  8>  =  mfx<(xv,  -  Si)- <xv,+1  -  \;,+l )!- 

Then,  by  Eq.  (3),  is  feasible,  because  -8,  ^  min(xW(  -  x,’7/)  <  min  xl(>(. 


After  the  execution  of  81L1,  the  transformed  solution  O',*)  replaces  (x,;)  in  Eq.  (6)  and 
the  procedure  is  repeated,  leading  to  the  construction  of  a  second  loop  L 2.  Additional  loops 
Lj,  L4,  ...  Lk  are  constructed  as  necessary.  The  algorithm  terminates  when  all  of  the  variables 
have  been  appropriately  modified,  i.e.,  when  8*+)  =  0.  At  each  stage,  at  least  one  variable 
attains  its  optimal  value  in  A\  the  remaining  loop  members  approach  their  optimal  values  in  A', 
while  no  "overshoots"  may  occur. 

The  transformations  employed  are  all  feasible  with  respect  to  the  original  solution  (x„) 
(although  they  are  actually  executed  on  a  transformed  solution).  However,  a  transformation 
away  from  the  optimal  solution  must  be  an  impairing  one,  implying  8,Q  >  0,  from  which  it 
follows  that  each  transformation  is  an  impairing  one  with  respect  to  A. 
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By  the  same  token,  the  inverse  transformations  (-S ,)L;  gradually  lead  from  A' to  A  when 
executed  in  the  sequence  (-8 k)Lk,  (-8*_|)I*_|,  ...  (-8|)I|.  Since  ( x,j )  is  optimal,  these 
transformations  are  impairing  with  respect  to  A',  implying  that  the  original  transformations  are 
improving  with  respect  to  A'. 

COROLLARY:  If  =  cu  in  all  but  one  row,  say  /0,  then  each  of  the  loops  L,  must  con¬ 
tain  members  of  row  i0. 

The  corollary  follows  from  the  fact  that  the  transformation  loops  are  improving  ones  with 
respect  to  A\  i.e.,  f>,CL  <  0,  and  impairing  ones  with  respect  to  A,  i.e.,  8 ,Q  >  0,  which  can 
only  occur  if  CL  ^  C'L  ,  i.e.,  the  loops  contains  members  of  i0. 

DEFINITION:  Let  S,  =  £  (xy  -  xj),  i.e.,  S,  represents  the  sum  of  the  positive 

xi>  >  xij 

differences  between  the  optimal  values  of  the  variables  in  row  /  of  the  original  problem  A  and 
the  transformed  problem  A 

Let  S  =  max  S,,  i.e.,  S  represents  the  maximum  such  difference  among  all  the  rows. 

/ 

LEMMA  2:  If  cu  =  cj  in  all  but  one  row,  say  /0,  then  S  =  S,-  <  alQ. 

PROOF:  By  the  Corollary  to  Lemma  1,  every  loop  L,  contains  members  of  row  i0.  But 

k 

row  /0  contains  initially  only  a,-  items,  hence  Sl(.  =  Z.  8,  <  aig.  As  far  as  the  other  rows  are 

i—  1 

concerned,  p  <  Ac  of  the  loops  L,  pass  through  any  specific  row,  so  that 

S,  -  Z  <  t  8,-  < 

i-i  i-i 

Thus,  S  =  max  S,  <  a,0. 

It  will  now  be  shown  that  the  case  in  which  A  and  A  differ  in  terms  of  origin  require¬ 
ments  reduces  to  the  case  in  which  the  two  differ  in  terms  of  transportation  costs. 

THEOREM  1:  Suppose  the  two  transportation  problems  A  and  ^4 '  differ  only  with  respect 
to  their  origin  requirements.  In  particular,  suppose  that 

(a)  a,'  <  a,  (i  =  1 .  .m  -  1) 

(b)  am  >  am 

m  m 

(c)  £  a,'  =  Z  a,. 

i- 1  /- 1 

Then  xmj  >  xmj ,  where  (xy)  and  (xj)  represent  the  optimal  solutions  of  A  and  A'  respectively. 
PROOF:  Suppose,  on  the  contrary,  that  there  exists  some  column  j,  such  that  xmJl  >  xmj). 

m  m 

Since  b,:  =  £  x,,(  =  £  x'iJr  there  must  exist  at  least  one  i2  ^  /j  =  m  such  that  x,jyj  >  x,^r 

i-i  /-i 

However,  from 

n  n 

=  Z  <  a,;  -  Z 

/-I  ‘  ’  7-1 
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it  follows  that  there  must  be  some  j2  ^  j\  such  that  >  x,'2j .  For  the  new  j2  there  must  be 
a  new  i3.  The  process  continues  until  one  returns  to  a  row  or  column  encountered  previously 
in  the  process,  and  a  loop,  say  L,  containing  k  cells  is  obtained. 

Regarding  loop  L  there  exist  three  possibilities: 

(a)  CL  =  0.  Perturbation  eliminates  this  possibility. 

(b)  CL  >  0.  Let  8  =  min  x. ,  >  min  x' ,  >  0.  Then  the  (-8)1  transformation  per- 
formed  on  A  is  an  improving  one,  since  -8  CL  <  0,  a  result  which  contradicts  the  optimality  of 
Of*/)- 

(c)  C,  <  0.  Let  8  =  min  x, .  >  min  x  .  >  0.  Then  the  8  L  transformation  per- 

formed  on  A'  is  an  improving  one,  since  SCL  <  0,  a  result  which  contradicts  the  optimality  of 

(xtj). 

The  initial  assumption  that  there  exists  a  variable  xmj]  >  x'mj  must  therefore  be  dropped 
in  favor  of  the  conclusion  that  xmj]  <  xmj  ( .  (Note  that  it  follows  from  Theorem  1  that  xmj  =  0 
implies  xmj  =  0  while  xmj  >  0  implies  x'mj  >  0  (1  <  j  <  n)  when  the  stipulated  conditions 
hold.) 

THEOREM  2:  Suppose  that  A  and  A"  differ  in  the  following  respects  as  regards  their  ori¬ 
gin  requirements: 

a[  -  a{+  a,  a'm*=  am  -  a ,  a-  —  a, 0  =  2,m  -  1). 

Then  Si  =  0,  Sm  =  a,  S,  <  a,  2  <  /  <  m  —  1,  i.e.,  S  —  a. 

PROOF:  By  Theorem  1,  considering  a\  to  be  am,  it  follows  that  x[2  >  x^-G  <  j  <  n), 
which  implies  that  S{  =  (x{j  -  x[j)  =  0. 

x\j>x'\j 

Also,  looking  upon  the  new  problem  as  the  original  and  vice-versa,  it  follows  from 
Theorem  1  that  x'mj  <  xmj,  which  implies  that 

n  t 

Sm  X  Xmj )  Qm  am  (X . 

J- 1 


In  order  to  prove  that  S,  <  o(l  <  i  <  m),  it  will  be  demonstrated  that  A  and  A'  are 
equivalent  to  two  transportation  problems  which  differ  from  each  other  with  respect  to  the 
transportation  costs  in  a  single  row  exclusively. 

Consider  the  auxiliary  (m  +  1)  x  n  transportation  problem  A  having  the  following  param¬ 
eters: 

a,  =  a,,  i  ~  1,  2,  . . .  m  -  1,  am  -  am  -  a, 

Om  +  I  "  «. 

c,j  -  c,j,  bj  -  bj  (1  <  /  <  m,  l  <  j  <  n), 

Tm+\.j  =  1^2^ 
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Let  A'  be  defined  identically,  except  that  cm+iJ  -  Ciy,  1  <  j  <  n.  Now ,  note  that  A  is 

equivalent  to  the  original  problem  A  with  xmj  -  xmj  +  xm+ij,  _ since  l(xmj  +  xm+lJ)  — 

(am  -  a)  +  a  -  am  (see  [3],  pp.  319-320).  On  the  other  hand,  A[  is  equivalent  to  A'  with 

x\j  "  X\j  +  xm+u,  since  I(x1;  4-  xm+Xj)  =  +  a.  Since  /4  and  A '  are  identical  except  for 

differing  transportation  costs  in  row  (m  +  1)  for  which  am+l  -  a,  it  follows  from  Lemma  2  that 
Si  ^  S  =  Sm+|  ^  a. 


THEOREM  3:  Suppose  A  and  A'  differ  as  follows  with  respect  to  their  origin  require¬ 
ments: 

(a)  a'„  =  ap  -  yp  (1  <  p  <  r,  0  <  yp  <  ap), 

(b)  as'  =  as  +  05  (r  <  s  <  m,  f3s  >  0), 

(c)  ip,  =  lyp. 

Then 

5  =  ZPs  “  I  («,'  -  as) 

and 

5,  <  L0,- 10,-0,  -  £  («;-«,). 

PROOF:  The  transition  from  /I  to  may  be  broken  down  into  a  finite  sequence  of  steps 
of  the  type  considered  in  Theorem  2.  The  process  may  be  described  as  follows.  Suppose  min 

{ yp,yp  >  0)  =  yP{  and  min  {ps,/3s  >  0}  =  0Si.  Let  a{  =  min  (yP|,0Sl).  Then  consider  the 

transformation  =  ap,  -  <*,,  af(l)  —  aSi  +  a,.  By  Theorem  2, 

S,  =  0,  i  =  s( 

5,  =  a,,  i  =  /?! 

5,  ^  of i »  i  5^  /?!,  Sj, 

5  =  a  (. 

Note  that  either  a^11  =  or  a^1 1  =  aP]  (or  both).  The  remaining  values  of  a,  are  unaffected. 
In  other  words,  one  of  the  requested  changes  has  been  executed,  a  second  has  been  (at  least) 
partially  implemented,  and  no  "overshoots"  have  occurred. 


The  second  step  involves  choosing  the  next  origin  requirement  to  be  transformed  to  a',. 
First,  one  sets  yPl  =  yP]  —  a\  and/3S|  =  /3S|  -  a)  (at  least  one  of  which  will  be  0),  and  then  the 
previously  described  algorithm  is  executed.  The  process  terminates  after  k  steps  when  all  yp 
and  /3S  equal  zero.  Applying  Theorem  2  to  each  of  the  k  stages,  one  finds  that 


k 

Z  ah 

h-\ 


I  Ps 

f—  r+1 


(since  each  /3S  has  been  reduced  to  0).  For  r  <  i  <  m,  St  =  0  for  every  transformation  which 
leads  to  an  increase  in  the  value  of  ar  Such  transformations  involve  a  total  increase  of  (3,. 
Taking  all  transformations  into  account  (even  those  which  do  not  involve  a,,  for  which 

k  m  m 

Si  <  ah),  one  finds  that  S’,  <  £  ah  -  /3,  =  £  ps  -  (3,  =  £  /3S.  The  latter  applies  when 

I  s-r+1  S“f  + 1 


i  <  r  as  well,  since  s  cannot  equal  /  in  such  a  case. 


z 
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DEFINITION:  Let  Dik(J)  =  ctJ  -  ekj,  i.e.,  A* O’)  equals  the  difference  in  costs  between 
the  cells  of  column  j  associated  with  rows  i  and  k,  respectively.  Since  it  is  always  possible  to 
introduce  cost  perturbation,  it  may  be  assumed  that  Dlk(J{)  =  Dik(j2 )  if  and  only  if  y,  =  y2. 

LEMMA  3:  If  xkv  >  0  and  x,M  >  0,  then  Dik(v)  >  Dik(fi),  where  (xu)  represents  the 
optimal  solution  of  transportation  problem  A. 

PROOF:  Consider  the  simple  loop  L  =  (k,  v);  (i>);  (i.fi).  Letting  8  equal  the 

minimum  of  the  even-numbered  cells  (i.e.,  8  =  min(jr*„,x,M)  >  0  by  hypothesis)  ensures  that 
the  8  L  transformation  is  feasible.  However,  (x„)  is  the  optimal  solution,  so  that  8 L  cannot  be 
an  improving  transformation;  i.e.,  of  necessity  8Cl  =  &(ck/J  -  ck„  +  c,„  -  c,M)  =  8[£>,*(")  - 
A*(m)1  >  0,  which  implies  that  Dik(»)  >  D,k(p.),  as  was  to  be  proven.  See  Fig.  1  for  a  sche¬ 
matic  representation. 


Dlk(p) 


Figure  I.  The  relative  positions  of  D,*(/x)  and  Dlk(v)  on  a  scale  whose  uppermost  ele¬ 
ment  corresponds  to  the  smallest  value  of  D,k(j)  and  whose  lowermost  element 
corresponds  to  the  largest  value  of  Dlk(j)  when  it  is  known  that  xk„  >  0  and  x,M  >  0. 


COROLLARY:  If  Dik(v)  <  Dik(ji),  it  is  impossible  that  both  xkl,  >0  and  xitl  >  0. 

DEFINITION:  Let  A ,  represent  the  set  of  columns  associated  with  the  optimal  group  of 
basic  cells  in  row  /  of  problem  A,  i.e.,  At  =  [j  |xy  >  0).  The  n,  elements  of  A,  may  be  arranged 
as  the  vector  Kik  =  (/],.  .  ,i„)  where  Dik(i\)  <  D,k (i2)  <  ...  <  Dik(i„),  i.e.,  each  basic  column  j 
is  ordered  in  terms  of  its  corresponding  Dik(j)  value.  The  integer  s(i,k )  will  be  determined  by 
the  following  inequality: 

(7)  £  x„  >  S,  >  £  *«  • 

It— s(i,k)  fi—s(i,k)+\ 

For  an  example  of  the  above  notation,  see  Fig.  2. 

Consider  the  transportation  problem  A'  which  differs  from  A  with  respect  to  origin 
requirements  exclusively.  Although  the  optimal  solution  0 cj)  differs  from  (x,y),  the  A*  O’) 
values  are  unaltered,  since  the  transportation  costs  have  not  changed. 

THEOREM  4:  If  v  <  s(i,k),  then  x'ki^  =  0(/„  €  Kik). 

PROOF:  Suppose  xki  >  0.  Then,  by  the  Corollary  to  Lemma  3,  for  all  v  <  n  <  n,  (for 
which  Dlk(iJ  <  D^U^)),  x‘u  -  0.  Since  €  A,  x„  >  0.  It  follows,  then,  that  x,,^  >  x,]  for 
all  v  <  s(i,k)  <  n  <  rti.  Then 
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K 


ik 


'1 


3 


4 


Figure  2.  In  the  above  figure,  s(i,k)  —  4  and  n,  -  7, 
where  it,  i2,  ...  i2  are  the  elements  of  A,. 


m-J 


Z  \  =  Z  ^  Z  (Xij-  xlj)  =  sr 


(/',*) 


n-sU.k ) 


In  short,  X  ^  S,.  But  by  definition  ^  x„  >  S,,  which  is  in  contradiction  to  the 

n-s(i.k)  M  li^sU.k)  **  , 

result  presently  obtained.  Accordingly,  the  initial  assumption  that  x*,  >  0  must  be  rejected. 


The  application  of  Theorems  3  and  4  is  as  follows.  One  first  calculates  s(i,k)  for  each  of 
the  m(m  -  1)  permutations  of  i  and  k,  a  task  which  requires  knowledge  concerning 
Sh  i  =  1,  . . .  w.  Theorem  3  enables  the  determination  of  an  upper  bound  on  5,,  which  in  turn 
provides  a  lower  bound  on  s(i,k ),  as  follows  from  Eq.  (7).  Utilizing  Theorem  4,  a  large 
number  of  variables  x'kj  are  set  to  0.  Whenever  all  but  one  of  the  variables  in  a  given  row  or 
column  equal  0,  the  remaining  variable  equals  a,  or  bj  as  the  case  may  be,  and  the  respective 
row  or  column  may  be  eliminated  from  the  tableau. 


3.  APPLICATIONS 


Reducing  the  Dimensionality  of  a  Certain  Class  of  Transportation  Problems 

m 

Consider  an  m  x  n  transportation  problem  A  for  which  =  R.  Furthermore, 

*-i  y- i 

m  1  m 

suppose  that  ^a,~aR  and  Z  a,  -  (1  -  <*)R.  The  present  application  concerns  the  case  in 


/-i 


i—m  |  +  l 


which  m\  <  <  m  and  a  is  near  1,  i.e.,  the  situation  in  which  most  of  the  merchandise  is 


i 

\ 


•fi  i  liiHuMrta 
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concentrated  at  a  relatively  small  number  of  warehouses.  This  would  be  the  case  when  a  small 
number  of  factories,  at  which  large  quantities  may  be  stored,  supply  a  larger  number  of 
intermediary  warehouses  of  significantly  smaller  storage  capacity. 

Consider  the  following  three  transportation  problems: 

A:  The  original  problem, 

A' :  An  m\  *  n  transportation  problem  identical  to  A  except  for  the  fact  that  the  last 

m 

m  -  m\  rows  of  A  are  deleted  and  a'm]  =  £  a,, 

/— m  | 


A":  An  m  x  n  transportation  problem  identical  to  A  except  for  the  fact  that  the  last 

m 

m  -  m\  values  of  a,  are  set  to  0  and  am[  =  £  a,.  Clearly,  an  optimal  solution  A  'also  optim- 

i»/n  | 

izes  A".  The  proposed  solution  procedure  is  as  follows: 

(i)  Solve  A'.  The  relatively  small  value  of  m  j  ensures  that  A  'will  be  a  long  transportation 
problem  for  which  an  extremely  efficient  computerized  solution  procedure  has  been  developed 
14],  The  solution  to  problem  A"  is  now  available,  and  the  previously  obtained  sensitivity 
analysis  results  will  now  be  utilized  in  finding  the  solution  to  A. 

(b)  Based  on  the  optimal  solution  to  A ",  find  the  set  of  positive  cells  in  row  /,  An  which 
serves  as  a  prerequisite  in  determining  the  ordered  vector  Kik  for  each  value  of  /,  1  <  /  <  m\ 
and  1  ^  k  ^  m,  k  ^  i,  leading  to  a  total  of  mx(m  -  1)  vectors  K,k. 

(c)  For  each  /,  1  <  /  <  mt,  determine  S,.  Note  that  A  and  A"  differ  only  with  respect  to 
their  requirements  a,.  More  specifically, 

a i  =  a",  i  =  1 ,  ...  m |  -  1 , 

am  |  =  am  |  ~  y  m , 

m  m 

(where -ymi=  £  a,,  since  a",  =  £  ai>^ 

i—m  j+1  /—  m  | 

a,  =  a,  +  /?,,  i  =  nt\  +  1,  . . .  m  (where  /3 ,  =  a,,  since  a’  =  0  over  this  range). 

m 

Since  —  £  ph  Theorem  3  is  applicable,  implying  that 

S,  ^  ]£  (a,  -  a")  —  (1  —  a)R,  i  —  1,  . . .  m,. 

i-m  |+1 

(d)  For  each  1  ^  i  <  nt\  and  1  <  k  <  m,  k  &  i,  determine  sU.k)  by  summing  the 
values  of  x,"  associated  with  the  lower  values  of  the  ladder  scale  Kik  until  they  exceed  (1  -  a)R 
(see  Eq.  7). 


will 

I 


(e)  By  Theorem  4,  for  v  <  sU.k),  xkl  =-  0(/,,  6  Klk).  Since  ^ -  a,, 

'  I  “ 

n{ 

be  useful  if  s(i,k)  >  1,  i.e.,  if  ^  x„  <  a,,  which  will 

li—stl.k)  “ 


the  theorem 
imply  that 
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/ 

**.,•  Xkiy  ...  ,|  -  0.  But  in  the  present  situation  £  x„  -  S,  <  (1  -  a)R  which 

by  assumption  is  small  with  respect  to  ah  so  that  s(i,k)  >  >  1  and  hence  a  large  number  of 
variables  xkj  will  be  set  to  zero  in  row  k. 

(0  The  mt(m  -  1)  s(i,k)  values  each  lead  to  a  set  of  xkl  values  which  are  known  to 
equal  zero  in  the  original  problem  A  Since  1  ^  k  ^  m,  such  zeroes  have  been  determined  for 
all  of  the  m  rows  of  A.  Whenever  all  but  one  of  the  variables  in  a  given  row  or  column  equal 
zero,  the  remaining  variable  equals  a,  or  6,  as  the  case  may  be  and  the  respective  row  or 
column  may  be  eliminated  from  the  tableau. 

(g)  A  second  means  of  obtaining  zeroes  is  available  for  row  m\  exclusively.  Since 

m 

a"  ^  a ,  (/'  =  1,.  ,.m,i  ^  m{)  and  a",t  =  £  a,  >  am  ,  it  follows  from  Theorem  1  that 

/  *  m  j 

x*|(  >  x„ir  Then  if  x’^  =  0,  it  follows  that  x,„  .  —  0  in  problem  A. 

(h)  Steps  (f)  and  (g)  serve  to  reduce  the  row  and  column  dimensions  of  the  remaining 
problem.  The  reduced  tableau  At  may  be  treated  in  two  ways: 

•  It  may  be  solved  as  a  routine  transportation  problem, 

•  It  may  be  solved  using  the  technique  of  Srinivasan  and  Thompson  [7],  If  A  "is  reduced 
in  size  in  a  manner  which  completely  parallels  the  reduction  in  size  of  the  original  problem  A, 
leading  to  A  then  an  optimal  solution  to  A  ’  is  available,  since  an  optimal  solution  to  the  ori¬ 
ginal  problem  A is  optimal  for  any  subtableau. 

Selecting  Additional  Warehouses  from  a  Predetermined  Set  of  Possibilities 

Consider  the  following  problem.  Given 

(a)  The  requirements  at  each  of  n  destinations, 

(b)  The  capacity  at  each  of  m\  origins, 

(c)  The  transportation  costs  between  each  of  the  m  |  origins  and  each  of  the  destinations, 

(d)  The  transportation  costs  between  each  of  (m  —  m\)  additional  (potential)  origins  and 
each  of  the  given  destinations, 

m 

(e)  The  total  additional  capacity  desired,  i.e.,  £  an 

i—m  |+l 

Find 


•  The  specific  values  of  a,,  m,  +  1  ^  /  <  m, 

•  The  amount  to  be  shipped  from  each  origin  to  each  destination  so  as  to  minimize  the 
total  shipping  cost.  The  number  of  origins  in  the  optimal  solution  equals  the  number  of 
sources  /  for  which  x0  >  0  for  at  least  one  value  of  j.  It  is  expected  that  some  of  the 
o,,  /«]  +  1  <  /  <  m,  will  equal  zero,  i.e.,  some  of  the  warehouses  will  not  be  built. 

The  problem  described  is  reminiscent  of  the  location-allocation  problem  [1,2]  in  the  sense 
that  the  capacities  and  final  locations  of  the  additional  origins  are  not  fixed  in  advance.  How¬ 
ever,  rather  than  choosing  coordinates  optimally,  the  optimal  capacities  and  locations  are  indi¬ 
cated  by  the  values  of  a ,  which  are  positive  for  minimal  cost  schemes.  The  final  locations  are 
thus  limited  to  being  a  subset  of  the  original  set  of  potential  warehouses  of  size  m  —  m j. 
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As  in  the  first  application,  the  procedure  to  be  presented  relates  to  the  situation  in  which 

(a)  £ a i  =  =  R, 

i- 1  7-1 

ml  m 

(b)  —  aR,  £  a,  =  (1  -  a)R, 

i—l  i~m  |+1 

(c)  <  <  m, 

(d)  a  is  close  to  1 . 

The  first  mx  origins  are  assumed  to  be  operating  and  indeed  supplying  most  of  the  merchandise, 
m  \ 

since  -  aR.  Accordingly,  ax,  a2,  ....  am  will  be  assumed  to  be  known.  The  question 

/-I  1 

relates  only  to  the  exact  and  optimal  values  of  crm|,  am[+|)  ...  am. 

The  approach  to  be  taken  involves  the  solution  of  a  number  of  long  transportation  prob¬ 
lems.  In  particular,  consider  the  following  transportation  problems: 

•  A,  an  m  x  n  transportation  problem,  with  cu  equal  to  the  known  transportation  cost 
between  each  of  the  m  origins  and  n  destinations.  A  cannot  be  solved  initially  because  no 
specific  values  have  been  specified  for  amyJr{,  ...  am. 

•  tCj+1,  an  (ffi,  +  1)  x  n  transportation  problem  for  which 

clj  =  Cij,  i'=l,  . . .  m,  j  -  1,  ...  n, 
cmx  +  l.j  =  Gn|  +  lJ>  j  =  •  •  •  H, 

a,'  =  ah  i  -  1,  ...  mh 

,  m 

|+i  "  (1  a)R. 


•  A",  an  m  x  «  transportation  problem  identical  to  A  except  for  the  fact  that  the  last 

,  m 

m  -  (mi  +  1)  values  of  a,  are  set  to  0,  and  a„]+]  =  £  a,. 

i—m  j 


Clearly,  any  optimal  solution  A'm  + 1  also  optimizes  A". 


is  a  long  transportation  problem  which  can  be  solved  efficiently  as  previously  noted, 
thus  providing  a  solution  to  A".  But  the  relationship  between  A  and  A"  may  be  stated  as  fol¬ 
lows: 


a"  <  ah  /'=!,  ...  m,  i  ^  mx  +  l. 


“m.+l 


2*  ^ 

/—  mj+1 


It  follows  then,  from  Theorem  1,  that 

^  j + 1  j 
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and  from  the  note  following  Theorem  1  that 

(ii)  j£l+XJ  -  0  implies  xm|+lj  -  0  for  all  1  <  j  <  n. 


A  method  for  determining  zeroes  in  row  mx  4-  1  of  the  original  problem  has  been 
presented.  The  number  of  zeroes  determined  will  be  quite  large,  since 

n 

—  a„1+1  —  (1  —  a)R  has  been  stipulated  to  be  small, 
y-l 

In  general,  since  the  fraction  of  the  total  requirements  assigned  to  row  m,  equals 

— — — ^  ■■,  the  average  number  of  zeroes  in  row  mx  will  be  an ,  where  a  is  close  to  one. 

R. 

Zeroes  may  be  obtained  in  rows  +  2,  m i  +  3,  ...  m  by  following  an  identical  pro¬ 
cedure.  For  example,  Am)+2  would  also  be  an  (m,  +  1)  x  n  problem  which  differs  from  Am]+X 
only  regarding  the  transportation  costs  in  row  mx  +  1,  which  would  now  equal  cm)  +  2j.  In  gen¬ 
eral,  for  Ami+k,  k  =  1,  . .  ,  m  -  mu 

*  Cmx+k.j  J  =  n- 


Zeroes  have  thus  been  determined  for  rows  mx  4-  1,  . .  m.  As  far  as  the  first  mx  rows  are 
concerned,  a  similar  procedures  may  be  followed,  although  it  will  be  a  little  less  fruitful.  Again 
the  total  amount  of  merchandise  associated  with  the  lower  rows,  (1  -  a)R ,  will  be  combined. 
However,  instead  of  letting  a,'  =  (1  -  a)R  for  one  of  the  lower  rows,  (1  -  a)R  will  be  added 
to  one  of  the  first  m,  rows,  and  a,  will  be  set  to  zero  for  i  =  m i  +  1,  . .  m.  The  expected 


number  of  zeroes  in  row  i  =  1,  ...  mx  will  then  be 


aR  —  a, 

a, 

•  n  — 

a  —  — 

R 

R 

•  n. 


As  in  the  previous  application,  the  large  number  of  zeroes  obtained  enables  the  elimina¬ 
tion  of  numerous  rows  and  columns  of  the  original  tableau  A ,  and  the  subsequent  reduction  in 
size  of  the  remainder  of  the  problem.  To  solve  the  reduced  problem,  one  must  supply  values 
for  ami+(,  ...  am.  However,  at  this  point,  the  dimensions  of  the  problem  to  be  solved  are 
sufficiently  small  that  a  large  number  of  alternative  sets  of  values  for  {om|+)...am)  may  be 
experimented  with  until  a  satisfactory  solution  is  obtained. 


4.  AN  EXAMPLE 


The  utility  of  the  preceding  algorithms  can  be  demonstrated  only  by  systematically  experi¬ 
menting  on  large-scale  problems.  The  purpose  of  the  present  small-scale  example,  on  the  other 
hand,  is  to  review  and  illustrate  the  previously  described  techniques,  terminology,  and  notation. 
Consider  the  5  x  10  transportation  problem  which  appears  as  Table  1.  Note  that  most  of  the 
merchandise  is  concentrated  at  origin  1,  so  that  mx  =  1. 

m 

A'  will  be  the  1  x  10  transportation  problem  with  am]  =  £  a,  =  59,  and  the  optimal 
solution  to  A' appears  in  Table  2. 

The  set  Ax  includes  all  columns  of  A'  having  a  basic  variable  in  row  1.  In  the  present 
example,  A\  =  {1,  2,  3,  4,  5,  6,  7,  8,  9,  10|.  For  j  €  A\,  the  value  of  DX2(J)  will  now  be 
determined  (Table  3)  and  arranged  in  ascending  order  (Table  4). 
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TABLE  1 


TABLE  3 


Destination  7 

1 

2 

3 

B 

5 

6 

B 

8 

_ 

9 

10 

0.2(7) 

2 

4 

2 

-2 

-2 

6 

-1 

BB 

3 

TABLE  4 


Ku  «  (8,  4,  5,  7,  9,  1,  3,  10,  2,  6). 

S,  <  £  (a,  -  a,’)  -  59  -  51  -  8 

i-i 

•*«  —  9  >  S]  —  8. 

M-10 

Thus  s(l,  2)  —  10.  It  follows  that  x'y  —  0(1  <  j  <  10,  J  ^  6). 
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Similarly,  if  the  Dl3(j)  are  arranged  in  an  increasing  sequence  one  obtains  Table  5. 


TABLE  5 


Destination  j 

7 

4 

2 

8 

9 

1 

10 

5 

6 

3 

DnU) 

-6 

-4 

-3 

-1 

0 

1 

2 

2 

5 

6 

xo 

2 

3 

8 

9 

8 

5 

3 

4 

9 

8 

10 

£*,,  =  17  >  Si  <  8 
m-9  “ 

Thus  x'3j  =  0  for  j  —  1,2,  4,  5,  7,  8,  9. 

For  Dxi(j)  and  Dl3(j),  the  results  of  Tables  6  and  7  are  obtained. 


TABLE  6 


j 

7 

8 

— 

10 

3 

4 

9 

- 1 

2 

6 

5 

1 

X,j 

2 

9 

3 

8 

3 

8 

8 

9 

4 

5 

Thus  x\ j  =  0  for  j  =  2,  3,  4,  6,  7,  8,  9,  10. 


TABLE  7 


j 

8 

9 

4 

5 

7 

1 

10 

6 

3 

2 

*u 

9 

8 

3 

4 

2 

5 

3 

9 

8 

8 

.*5,  —  0  for  j  -  1,  4,  5,  6,  7,  8,  9,  10. 


The  results  of  the  present  ca'culations  are  arranged  in  a  5  x  10  table  (Table  8)  enabling 
the  reduction  of  the  original  problem  A  to  the  tableau  of  Table  9.  Note  that  the  second  means 
of  obtaining  zeroes  suggested  in  the  previous  section  is  not  applicable  here  since  the  variables 
in  row  m  i  =  1  are  all  positive. 


TABLE  8 


\  J 

i  \ 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

a, 

1 

3 

2 

9 

8 

3 

51 

2 

X 

X 

X 

X 

X 

1 

X 

X 

X 

X 

1 

3 

X 

X 

X 

X 

X 

X 

X 

X 

2 

4 

X 

X 

X 

X 

X 

X 

X 

X 

2 

5 

X 

X 

X 

X 

X 

X 

X 

X 

3 

5 

8 

8 

3 

4 

9 

2 

9 

8 

3 

59 
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1 

3 

4 


5 


a, 

In  Tables  8  and  9,  an  x  in  location  ( ij )  means  that  variable  (i.j)  has  been  determined  to 
be  equal  to  zero.  Note  that  the  original  values  of  a ,  and  b2  have  been  reduced  by  the  optimal 
values  of  Xy  which  have  already  been  determined;  e.g.,  instead  of  A]  — 51, 
zz ]  —  JC|4  —  X|7  -  jr18  -  jc,9  =  26  appears  in  Table  9.  Similarly,  b6  has  been  reduced  by  one. 
The  reduced  problem  is  of  dimension  4x5  rather  than  5x10. 

The  sample  problem  previously  described  will  be  used  to  demonstrate  the  second  applica¬ 
tion.  As  before,  a\  =  51.  Suppose  £o,  =  8  as  before,  but  the  exact  values  of  these  origin 

i-2 

requirements  are  unknown,  and  it  is  desired  to  determine  them  optimally. 

In  the  present  case,  the  number  of  known  values  of  a,,  i.e.  mu  equals  one.  =  A'2 

,  5 

is  formed  by  setting  a2  =  £a,  =  8,  and  appears  in  Table  10  together  with  its  optimal  solution. 

i-2 


TABLE  9 

1  2  3  5  6  bj 


r~ 

51  -  25  =  26 

X 

X 

X 

2 

X 

X 

X 

2 

X 

X 

X 

3 

5 

8 

8 

— 

4 

8 

_ 

TABLE  10 


«/  \ 

5 

8 

8 

3 

4 

9 

'  2 

9 

8 

3 

51 

_±J 

5 

JLl 

8 

_u 

8 

_JJ 

3 

_6j 

4 

_zJ 

4 

Lu 

2 

_2j 

9 

_lI 

8 

-Jj 

8 

_2J 

jlJ 

_5j 

_8J 

5 

[_4J 

_5J 

-JJ 

3 

/C|+J  —  Aj  is  formed  by  setting  aj  =  £a,  =  8,  and  appears  in  Table  11  together  with  its 

f-2 

optimal  solution,  as  do  A\  (Table  12)  and  A  j  (Table  13).  It  should  be  noted  that  the  same  ini¬ 
tial  solution  may  be  used  when  solving  each  of  the  auxiliary  problems. 


TABLE  11 


SI 

5 

8 

8 

3 

4 

9 

2 

9 

8 

3 

51 

EH 

BJ 

_7J 

3 

_Li 

3 

_6j 

4 

_u 

9 

_U 

2 

L JJ 

9 

1 

_±J 

8 

a 

a 

_JJ 

5 

_zJ 

r^J 

- 

JJ 

r JJ 

3 
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TABLE  12 


a,  \ 

5 

8 

8 

3 

4 

9 

2 

9 

8 

3 

51 

_4j 

oo|^ 

_ZJ 

8 

_ 1 J 

3 

Jil 

1 

9 

-l\ 

2 

_2 J 

9 

|_il 

8 

3 

8 

_oJ 

5 

IsT 

_§J 

_2j 

^1 

3 

_6J 

TABLE  13 


\  >>J 

«i\ 

5 

8 

8 

3 

4 

9 

— 

2 

9 

8 

3 

51 

_U 

5 

_ll 

8 

_zJ 

8 

_l! 

-Al 

4 

_zJ 

1 

-  , 

_il 

2 

_2J 

9 

-Ll 

8 

-Al 

3 

8 

_2J 

_oJ 

_ iJ 

-Ll 

_1 1 

-ll 

8 

-Al 

JlJ 

As  previously  explained,  the  zero  variables  in  rows  2  of  A'2,  A\,  A4  and  A  '5  will  remain  zero  in 
rows  2,  3,  4,  and  5,  respectively  of  the  original  problem  A,  independent  of  the  (unknown 
optimal)  values  of  a,.  The  information  presently  available  is  displayed  in  Table  14. 


x  represnts  a  variable  whose  value  al  optimality  is  known  to 
equal  zero. 


Columns  2,  4,  7,  8,  9  may  be  removed.  Row  5  may  also  be  removed.  The  missing  vari¬ 
able  in  row  5  will  equal  a5,  since  all  the  other  x5j  =  0.  The  reduced  4x5  problem  appears  in 
Table  15.  In  addition  to  the  zeroes  indicated  by  Table  14,  the  optimal  values  of  the  following 
variables  are  now  known:  x]2  “  8;  X|4  =  3;  Xn  =  2,  =  9,  x19  =  8,  x56  =  a5. 
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TABLE  15 


1  3  5  6  10 


JLJ 

7 

_6J 

-11 

_il 

21 

Jj 

X 

X 

_8j 

X 

_Ll 

Jj 

X 

_u 

-A] 

X 

_2J 

X 

as 

J0J 

_8j 

X 

_iJ 

X 

6 

X 

a 4 

5 

8 

4 

9-  a5 

_ 

3 

The  exact  values  of  the  a,  will  now  be  determined  by  trial  and  error.  What  has  been 
gained  is  that  instead  of  solving  a  5  x  10  tableau  many  times,  one  solves  a  4  x  5  tableau  many 
times.  The  additional  preliminary  work  of  solving  4  long  (2  x  10)  transportation  problems  is 
not  overly  burdensome,  and  in  addition  to  enabling  a  reduction  in  dimensionality,  it  provides 
knowledge  regarding  the  exclusion  of  certain  variables  in  the  reduced  tableau  from  the  final 
solution. 
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ABSTRACT 

We  consider  the  problem  of  searching  for  a  target  that  moves  in  discrete 
time  and  space  according  to  some  Markovian  process  At  each  time,  a  searcher 
attempts  to  detect  the  target.  If  the  searcher's  action  at  each  time  is  such  as  to 
maximize  his  chances  of  immediate  detection,  we  call  his  strategy  'myopic." 
We  provide  a  computationally  useful  necessary  condition  for  optimality,  and 
use  it  to  provide  an  example  wherein  the  myopic  strategy  is  not  optimal 


DESCRIPTION  OF  THE  PROBLEM 

Let  X  be  the  position  space  within  which  the  target  moves,  and  let  P(x,  1)  be  the  proba¬ 
bility  that  the  target  starts  at  position  *  at  time  1  (so  £  P(x.  1)  =  1).  In  general,  let  P(x,t) 

xtX 

be  the  probability  that  the  target  is  at  position  x  at  time  t  and  has  not  been  detected  by  any  of 

the  searches  at  times  1,  2 . t  -  1;  x  €  X,  1  <  f  <  m.  If  is  the  probability  that  a 

target  at  position  x  at  time  /  will  not  be  detected  at  time  r,  then  P(x,t)  -  P(x,t)  q(x,t)  is  the 
probability  that  the  target  is  at  x  at  t  and  has  not  been  detected  by  any  of  the  searches  at  times 
l . t.  If  q(x, i)  is  selected  to  minimize  £  P(x.i)  within  whatever  constraints  are  imposed 

xtX 

on  the  search,  then  the  search  is  myopic.  After  the  search  at  time  r,  the  target  moves  to  its 
position  at  time  t  +  1;  thus 

(1)  Piy.t  +  1)  -  £  P(x.t)  r Uy.th  y  e  X.  1  <  /  <  m  -  1, 

xfX 

where  r(.v,y,r)  is  the  probability  that  the  target  is  at  yat  t  +  1,  given  that  it  is  at  xat  t. 

The  problem  is  to  select  q(x.t),  x  €  X  and  1  ^  t  ^  m,  so  as  to  minimize 

sm  =  £  P(x.m),  where  s,„  is  just  the  probability  that  the  target  is  never  detected  at  any  time 

tE.r 

<  m.  We  will  introduce  a  necessary  condition  for  optimality  in  problems  of  this  sort  that  lends 
itself  to  successive  improvement  of  search  strategies,  and  will  give  an  example  showing  that  the 
myopic  strategy  can  be  improved. 

BACKGROUND 

Pollock  [9]  considered  a  special  case  of  this  problem  where  there  were  two  positions  and 
where  the  searcher  had  to  decide  which  one  to  search  at  each  time.  He  solved  it  using  an  appli¬ 
cation  of  dynamic  programming  that  is  not  easily  extended.  Dobbie  [4]  subsequently  solved  a 
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similar  problem  where  time  was  continuous,  rather  than  discrete,  lida  [8]  obtained  necessary 
and  sufficient  conditions  for  optimality  when  the  target’s  notion  is  discrete  but  not  necessarily 
Markov,  when  the  detection  function  is  regular  1121,  and  when  effort  is  infinitely  divisible.  He 
used  his  condition  to  solve  a  problem  where  there  were  five  essentially  different  paths  that  the 
target  might  follow.  Heilman  [7]  and  Saretsalo  [111  have  also  derived  necessary  conditions  for 
optimality  in  continuous  diffusion  and  in  general  Markov  motion,  respectively.  All  of  these 
conditions  differ  from  ours,  and  are  apparently  not  as  amenable  to  computation. 

The  author  became  interested  in  this  subject  after  reading  Heilman  [5,7],  who  showed 
that  the  effect  of  random  search  on  a  target  whose  motion  is  a  diffusion  is  to  introduce  an  extra 
term  into  the  diffusion  equation  for  the  probability  density.  After  making  a  discrete  approxima¬ 
tion  to  the  diffusion  equation,  the  author  based  an  algorithm  for  finding  the  optimal  effort  dis¬ 
tribution  on  a  necessary  condition  involving  the  adjoint  of  the  discrete  diffusion  equation,  and 
submitted  a  paper  on  the  technique  to  this  journal  in  1976. 

During  a  revision  of  this  paper.  Brown  ( 1  ]  appeared  and  gave  necessary  and  sufficient  con¬ 
ditions  for  optimality  in  the  case  of  arbitrary  target  motion  in  discrete  time  and  space  but  with 
an  exponential  detection  function.  Brown  derived  an  algorithm  for  optimizing  search  plans 
which,  like  his  conditions,  was  based  on  planning  search  at  a  given  lime  interval  using  a  target 
location  distribution  which  accounted  for  failure  to  detect  the  target  both  with  past  and  with 
future  search.  This  algorithm  is  based  on  the  simple  but  powerful  condition  involving  the  func¬ 
tion  that  is  Q(x.t.q)  in  this  paper,  representing  the  effect  of  future  search,  and  playing  the  same 
algorithmic  role  as  the  solution  of  the  adjoint  equation  in  the  author's  earlier  submission.  In 
the  present  paper,  we  observe  that  a  slight  generalization  of  Brown’s  condition  for  Markov 
processes  is  necessary  for  optimality  when  the  detection  function  is  arbitrary,  as  long  as 
glimpses  at  different  times  are  independent.  In  particular,  we  permit  the  restriction  that  all 
effort  must  be  placed  in  a  single  cell  at  each  opportunity,  which  is  sometimes  a  restriction  in 
practice. 

THE  NECESSARY  CONDITION 

To  emphasize  the  dependence  of  P(x.i)  on  the  search  strategy  </(•,•),  we  will  write 
P(x,t,q)\  however,  P(-,t,q.)  does  not  depend  on  qi-.r)  for  r  ^  i.  Define  Qix.t.q )  to  be  the 

probability  that  the  target  is  not  detected  by  any  of  the  searches  at  /  +  1 . /«,  given  that 

the  target  is  at  xat  time  r,  with  ■)  =  1.  Q(-,t,q)  does  not  depend  on  </(-.  r)  for  r  i. 

For  any  i  between  1  and  m,  the  probability  that  the  target  is  not  delected  by  any  of  the 
first  m  searches  is,  according  to  the  theorem  of  total  probability, 

(2)  v„.  =  £  P(x.t.q)  q(x.t)  Q(x.l.q). 

v  e  t 

Front  our  earlier  observations,  the  product  P(x.t.q)  Q(x.t.q)  does  not  depend  on  q(-.i).  Let 
S(q,t )  be  the  set  of  functions  /(•)  that  are  feasible  for  q(-,t)  when  </ (  .  -)  is  specified  for 
t  pi  i.  Then,  if  s,„  is  minimal,  it  must  be  true  that  ]T  P(x,i,q )  fix)  Q(x,t,q)  is  minimized 

x  C  V 

for  /  €  S(q.t)  when  ./(•)  =  </(•,/).  This  is  our  necessary  condition.  The  computational  useful¬ 
ness  of  the  condition  arises  from  the  fact  that  functions  of  position  only,  rather  than  position 
and  time,  are  involved  in  the  optimization.  Strategies  satisfying  the  necessary  condition  will  be 
refe,  red  to  as  "critical." 


V 


318 


A  R.  WASHBURN 


Repeated  passes  through  the  flow  diagram  will  result  in  a  succession  of  strategies,  each 
strictly  better  than  the  preceding  as  long  as  the  preceding  does  not  satisfy  the  necessary  condi¬ 
tion.  Before  each  pass,  the  function  Q(-,  \q)  must  be  computed.  This  can  be  done  recursively 
by  initializing  Q(-.m.q)  -  1,  x  6  X,  and  then  making  m  -  1  applications  of  Eq.  (3): 

(3)  Q(x,t  -  1  ,q)  -  £  r(xy,f  -  1)  q(y.t)  Q(y.t.q), 

>'  6  X 

x  €  X.  i  =  m,  m  —  1,  . . .  ,  2. 

MYOPIC  STARTING  POINT 

The  present  writer  and  also  Brown  [1]  have  found  that  one  has  to  be  ingenious  to  think  of 
examples  where  the  myopic  strategy  differs  significantly  in  terms  of  detection  probability  from 
strategies  that  satisfy  the  necessary  condition.  The  problem  in  the  next  section  will  provide  a 
typical  example  where  the  difference  is  small.  Based  on  this  experience,  it  is  reasonable  to  con¬ 
clude  that  the  myopic  strategy  provides  a  good  starting  point  for  the  iterative  procedure  dis¬ 
cussed  in  the  previous  section.  This  can  be  done  by  beginning  the  first  iteration  with  Q(-.  -.q) 
=  1. 

AN  EXAMPLE 

Let  X  be  the  first  67  integers  with  the  target  initially  on  number  34.  The  target  moves  in 
a  discrete  diffusion;  from  interior  points  it  goes  left  or  does  not  move  or  goes  right  with  proba¬ 
bilities  0.3,  0.4,  0.3,  respectively.  From  either  of  the  two  end  positions,  the  target  stays  where 
it  is  or  returns  to  the  closest  interior  point  with  probabilities  0.4  or  0.6,  respectively.  At  each 
opportunity,  the  searcher  picks  any  one  of  the  67  integers;  if  the  target  is  currently  at  that 
integer,  the  probability  that  it  will  not  be  detected  is  0.875,  or  otherwise  1.0.  There  are  80 
opportunities. 

If  the  searcher  were  to  pick  an  integer  at  random  at  each  opportunity,  the  probability  of 
nondetection  for  all  80  opportunities  would  be  (1  —  0.125/67)80  =  0.8612.  Successive  passes 
through  the  flow  diagram  in  Fig.  1,  with  Q(-.-.q)  =  1  initially,  produce  probabilities  of  non¬ 
detection  of  0.3664,  0.3641,  0.3629,  0.3626,  and  0.3625,  with  0.3664  corresponding  to  myopic 
search  (q,)  and  with  the  strategy  corresponding  to  0.3625  (q2)  being  critical.  q2  is  not  globally 
optimal  in  this  problem;  after  trying  some  different  starting  points,  another  strategy  iqj)  satisfy¬ 
ing  the  necessary  condition  with  nondetection  probability  0.3623  was  found.  A  globally  optimal 
search  strategy  for  this  problem  is  unknown. 

Figure  2  shows  (/,  and  contrasts  it  with  q2.  q2  is  somewhat  more  "spread  out”  than 
because  Q(  .  .q2)  is  smallest  for  values  of  x  near  the  center.  /*(■.  .q2)  and  CM  .  .</:>  can  be 
seen  in  Figs.  3  and  4.  Figure  5  contrasts  P(.v. 40 . </ 1 )  with  P(  v. 40 .</;).  The  myopic  strategy 
lends  to  cause  a  flat  spot  in  the  region  being  searched  that  is  missing  when  critical  strategies  are 
employed. 

EXTENSIONS  AND  POSSIBLE  EXTENSIONS 

Permitting  space  to  be  continuous  rather  than  discrete  is  simply  a  matter  of  replacing 
sums  by  integrals  and  [  (.v.i.r)  by  l’(.v.<A.r)  in  the  preceding  Permuting  time  to  be  continuous 
would  be  more  difficult,  since  one  would  have  to  begin  by  changing  the  definition  of  q(.x.i)  and 
abandoning  the  simple  step-by-step  proofs  that  are  possible  when  time  is  discrete.  We  will  not 
speculate  on  the  form  of  the  necessary  condition  in  this  case 
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I'igurf  4  The  I’unciion  Qix.t.q 2) 


The  necessary  condition  would  be  more  useful  if  it  were  also  sufficient.  Brown  [1]  has 
shown  that  it  is  in  fact  sufficient  when  the  detection  law  [12]  is  exponential  and  a  fixed  amount 
of  search  effort  is  available  at  each  time.  His  proof  would  hold  for  any  strictly  convex,  decreas¬ 
ing  detection  law. 

In  search  for  a  stationary  target,  the  myopic  strategy  is  optimal  for  both  maximizing  detec¬ 
tion  probability  at  a  fixed  time  and  for  minimizing  the  expected  time  to  detection  [12].  The 
myopic  strategy  can  be  implemented  by  providing  a  display  of  P(-.i.q)  to  an  operator  who 
presumably  selects  positions  x  at  which  P(x.t.q)  is  large  at  which  to  continue  the  search; 
computer-aided  search  schemes  such  as  [2]  function  in  this  manner  even  when  the  search  is  for 
a  moving  target.  Strictly  speaking,  one  ought  to  be  explicit  about  the  objective  function  when 
searching  for  a  moving  target,  since  myopic  search  is  not  generally  optimal  for  the  probability 
of  detection  in  a  fixed  time  problem  when  the  target  moves.  Whether  myopic  search  is  optimal 
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Figure  5.  Comparison  of  />(.v.40.</l)  with  P(x. 40.^  J. 


or  even  near  optimal  for  the  expected-time-to-detection  problem  is  unknown.  This  is  unfor¬ 
tunate,  since  expected-time-to-detection  is  a  reasonable  criterion  in  problems  where  the  target 
is  very  valuable  and  search  cost  is  proportional  to  time  spent  searching.  At  the  present  time,  to 
the  best  of  this  author’s  knowledge,  there  is  no  algorithm  applicable  to  the  problem  of  comput¬ 
ing  the  search  strategy  that  provides  the  minimal  expected  time-to-detection.  Construction  of 
such  an  algorithm  would  be  a  useful  extension. 
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ANALYSIS  OF  DATA  FROM  LIFE-TEST  EXPERIMENTS 
UNDER  AN  EXPONENTIAL  MODEL 
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ABSTRACT 

This  paper  discusses  situations  in  which  the  distribution  of  a  lifetime 
response  variable  T  is  taken  to  depend  upon  a  vector  x  of  regressor  variables. 
We  specifically  consider  the  case  in  which  T,  given  x,  has  an  exponential  distri¬ 
bution,  and  in  which  x  represents  levels  of  fixed  factors  in  an  experimental 
design.  Methods  of  analyzing  data  under  this  type  of  model  are  discussed,  with 
maximum  likelihood  and  least  squares  methods  being  presented  and  compared. 


1.  INTRODUCTION 

In  many  life-testing  situations  the  life  distribution  of  the  items  under  study  is  dependent 
upon  physical  and  environmental  factors.  For  example,  the  lifetime  of  a  capacitor  may  depend 
upon  the  voltage  and  temperature  the  capacitor  is  subject  to,  the  time  to  breakdown  of  a  type  of 
electrical  insulation  may  depend  on  voltage  as  well  as  certain  physical  characteristics  of  the  insu¬ 
lation,  and  so  on.  With  such  items,  life-test  experiments  are  often  carried  out  in  which  several 
factors  affecting  lifetime  are  varied  simultaneously.  This  paper  deals  with  the  analysis  of  data 
from  such  experiments.  We  deal  specifically  with  situations  in  which  the  life  distribution  of  the 
items  under  consideration  is,  at  fixed  environmental  conditions,  exponential.  We  also  assume 
that  the  scale  parameter  in  the  exponential  distribution  is  related  to  the  environmental  factors 
in  a  multiplicative  fashion.  That  is,  the  lifetime  T  of  an  item  depends  on  a  given  set  of 
environmental  factors  which  can  be  represented  by  a  (row)  vector  x.  The  distribution  of  T, 
given  x,  is  exponential  with  density 

(1)  Of'  exp (-t/Oj,  t  >  0, 

where  0X  =  expCxg)  =  expOc^  +  . . .  4-  xkf3k).  This  paper  deals  mainly  with  analysis  of  vari¬ 
ance  type  models,  in  which  0  depends  in  a  multiplicative  way  on  fixed  factors,  in  which  case  it 
is  often  convenient  to  write  the  model  in  a  form  (see  Eq.  (4))  analogous  to  that  used  in  normal 
theory  analysis  of  variance. 

The  model  described  above  covers  a  wide  variety  of  applications.  Before  proceeding  we 
briefly  note  some  examples. 

EXAMPLE  1 :  Zelen  [16,17]  has  considered  factorial  experiments  with  an  exponential 
model;  for  a  two-way  model,  for  example,  we  have  the  mean  life  0  depending  on  two  factors  A 
and  B,  having  a  and  b  levels,  respectively.  His  assumed  model  has 

(2)  By  -  majbjCij,  i  =  1 . a  ;  j  -  1,  ....  b. 
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where  the  parameters  are  subject  to  restrictions 

(3)  n  a,  —  n  b>  —  n  cu  —  n  c u  -  1. 

i—i  j- 1  1  i-i  1  j- 1  u 

This  model  can  be  written  in  the  form  (1),  with 

(4)  9,j  -  exp(/i  +  «,  +  j  +  yu), 

where  n  =  log  m,  a,  =  log  a,,  /3y  *  log  bJt  and  >/,  =  log  cu.  In  this  paper  we  generalize  and 
extend  Zelen’s  method  of  analysis  for  this  model. 

EXAMPLE  2:  Power  law  models  play  a  large  role  in  many  engineering  studies.  For 
example.  Nelson  [12]  and  Lawless  [9]  have  considered  data  on  time  to  breakdown  of  electrical 
insulating  fluid  under  constant  voltage  v.  The  assumed  model  involves  a  power  rule,  with 
9  -  cv~p;  letting  x  **  log  v,  this  can  be  expressed  in  the  form  (1),  with  9  —  exp(a  4-  px), 
where  a  —  log  c  and  P  —  -p. 

EXAMPLE  3:  All  four  models  discussed  on  pp.  421-422  of  Mann  et  al.  [11]  can  be 
expressed  in  the  form  (1).  For  example,  the  Generalized  Eyring  Model  [11,  p.  422]  has  O  '  = 
AT[t* p  (—B/kT)]  [exp  (CV  +  DV/kT)],  where  T and  Fare  environmental  stresses,  A,  B,  C, 
D  are  unknown  parameters,  and  k  is  Boltzmann’s  constant.  This  model  can  be  written  in  the 
form  (1)  with 

0  =  exp[-  log  A  -  log  T+  B(l/kT)  -  CV  -  D(  V/kT)]. 

Several  authors  have  considered  the  analysis  of  data  from  models  of  the  form  (1).  For 
example.  Lawless  [9],  Prentice  [13],  Singpurwalla  [14],  and  Kahn  [5]  discuss  the  particular 
form  of  Eq.  (1)  mentioned  in  Example  2  previously,  while  Lawless  and  Singhal  [10],  Prentice 
[13],  and  Singpurwalla  et  al.  [15]  discuss  models  (1)  with  more  than  one  regressor  variable 
present.  This  paper  deals  primarily  with  analysis  of  variance  type  models  such  as  Eq.  (4):  these 
can,  of  course,  be  treated  as  special  cases  of  Eq.  (1)  and  analyzed  in  this  way,  though  it  is 
sometimes  useful  to  dwell  on  the  particular  " fixed-effects"  form  of  Eq.  (4).  This  has  received 
less  attention  in  the  literature,  with  the  main  results  to  this  point  given  by  Zelen  [16,17],  In 
this  paper  we  extend  Zelen’s  methods  of  analysis  and  discuss  alternate  methods.  In  particular, 
Zelen  has  discussed  the  model  (4)  in  the  special  case  in  which  the  experiment  is  run  so  that  at 
each  combination  of  factor  levels  (ij)  only  the  first  r  failure  times  are  observed.  He  shows  how 
to  carry  out  likelihood  ratio  tests  concerning  various  factors  in  the  model,  and  how  to  obtain 
confidence  intervals  for  ratios  of  different  fly’s  and  b/s  in  Eq.  (2).  We  extend  his  work  by  con¬ 
sidering  the  case  where  the  first  rtJ  failure  times  are  observed  at  levels  Uj).  We  also  consider 
more  detailed  analyses  of  data  from  the  model  (4),  involving  the  examination  of  treatment  con¬ 
trasts,  for  example. 

2.  TWO-FACTOR  EXPERIMENTS 

The  remainder  of  the  paper  will  be  mainly  concerned  with  analyzing  data  from  a  two- 
factor  model  of  the  form  (4).  This  allows  us  to  discuss  all  relevant  features  of  proposed 
methods  of  analysis;  the  techniques  are,  however,  easily  used  with  other  models  of  the  form 
(I).  The  two-factor  model  has  previously  been  discussed  by  Zelen  [16,17],  who  gives  methods 
of  analysis  based  on  likelihood  ratio  tests.  We  extend  (and  correct,  in  one  instance)  his  work 
here,  and  also  discuss  alternate  methods  of  analysis. 

We  consider  experiments  in  which  there  are  two  factors  A  and  5,  having  a  and  b  levels, 
respectively,  with  experimentation  at  all  ab  factor  level  combinations.  At  the  levels  U.j  )  for  A 
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and  B,  n,j  items  are  simultaneously  put  on  test,  the  experiment  being  terminated  at  the  time  of 
the  r,;th  item  failure.  Let  tijk  be  the  k  th  failure  time  for  combination  ( ij )  ( k  -  l,...,rv);  the 
tiJk's  are  the  first  rtJ  ordered  observations  in  a  sample  of  size  n,j  from  an  exponential  distribu¬ 
tion  with  mean  fly  (/'  —  1,. .  .,a  \j  -  1,. .  .,b).  We  consider  the  model  (4)  where,  in  the  most 
general  case, 

(5)  log  By- H  +  otj  +  pj  +  yy. 

There  are  only  ab  independent  parameters  in  Eq.  (5),  and  for  convenience  we  take  the  linear 
restrictions  on  the  parameters  to  be 

(6)  X  r>  “<  “  Z  rjPj  ~  0 

i-t  ;-i 

a  b 

jL  Wu  =  0  (for  all  j).  X  Wu  “  0  (for  all  /). 

<-i  j-\ 

b  a  a  b 

Here  rt  =  X  ry’  rj  =  £  rU'  we  *ater  a*so  use  r  .  -  X  I  rv 
j- 1  i-i  /-I  j- 1 

It  is  well  known  [4]  that  the  quantities 

T,J  =  i  tijk  +  (t> ij  -  rij)tjjr 
k- 1 

are  sufficient  for  the  fly’s,  and  that  T{j  has  a  gamma  distribution  with  density 

T  r;j~ 1 

ATy-fiy)  = - - - 7*  exp(-7yfly). 

(r,j  ~  D!  A,/ 

The  likelihood  function  for  the  model  is 


L (fly’s)  -  n  n  /(TV,  fly) 
/-i  j- i 


and  the  log  likelihood  function  is,  except  for  an  additive  constant  which  we  omit  without  loss  of 
generality, 

(7)  !og  L  =  -  X  £  r,j  log  «rXI  W 

/-I  ;-l  /- 1  y- 1 

We  now  discuss  methods  of  analyzing  data  from  this  model.  We  begin  by  considering 
likelihood  ratio  tests  of  various  factors  in  the  model,  and  then  proceed  to  a  discussion  of  max¬ 
imum  likelihood  and  least  squares  procedures. 

3.  LIKELIHOOD  RATIO  TESTS  OF  EFFECTS 

In  analyzing  data  from  the  model  in  Section  2,  a  number  of  hypotheses  will  be  of  interest. 
These  include  hypotheses  of  "no  interaction"  between  factors,  and  hypotheses  concerning  "main 
effects."  To  formulate  these  ideas,  we  consider  a  set  of  five  hypothesized  models: 

H4  :  log  fly  -  m  +  «/  +  Pj  +  y ij. 

Hi :  log  fly  -  fi  +«,  +Pj. 

H2  ■■  log  fly  -  fl  +  Pj  • 


1  -  ■  I  ■■  1  ■ 
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// 1  :  log  9jj  —  fi  +  a,, 

H0  :  log  Ojj  =  ix. 

In  these  models,  the  a,’s,  p/s,  and  -y/y’s  satisfy  the  linear  restrictions  given  in  Eq.  (6). 
Preliminary  analysis  of  data  will  often  involve  testing  some  of  these  models  against  others.  For 
example,  to  test  for  "no  interaction"  between  A  and  B,  we  test  //3  vs  H4,  taking  H3  as  the  null 
and  H4  as  the  alternative  hypothesis.  In  general,  to  test  Hs  (null  hypothesis)  vs  H,  (alterna¬ 
tive),  we  can  use  the  generalized  likelihood  ratio  test  statistic 

*»  -  -  2  log  [Imax  (Hs)/ Lmax  (//,)] 

=  2(AS  -  A,). 

where  A,  =  -  log  Lmax  (HJ  is  minus  the  maximum  of  log  L  under  the  model  Hs.  Under  Hs, 
A  „  is  asymptotically  (i.e.,  as  the  r,'s  become  large)  x2  with  degrees  of  freedom  given  by  the 
difference  in  the  number  of  functionally  independent  parameters  in  H,  and  H,  (e.g.  [7]  Ch.  24). 

We  now  give  the  maximum  likelihood  estimates  (m.l.e.’s)  and  maximized  log  likelihoods 
necessary  for  carrying  out  likelihood  ratio  tests  for  models  H0  to  H4. 

H4.  Under  //4,  there  is  a  one-to-one  relationship  between  {0,y}  and  {ju.,  ah  /3y,  y,y),  deter¬ 
mined  by  Eqs.  (5)  and  (6).  It  is  well  known  [4]  that  the  maximum  likelihood  estimate  of  0,y  is 
9,j  =  Tij/r,j.  We  find  that  (henceforth  all  sums  over  i  are  from  1  to  a  and  over  j  are  from  1  to 
b ,  unless  stated  otherwise)  the  parameter  estimates  /l,  a,,  /3y,  and  -y,y  are  given  by  the  following 
equations: 

i)  A  ”  r,j  log  9U, 

r  .  .  /  j 

ii)  y,j  =  log  90  -  a,  -  Pj  -  jl,  where 

iii)  The  a/ s  and  p/ s  are  found  as  solutions  to  the  linear  equations 

(9)  X  ru  l°8  h  =  r>.  +  «<)  +  Z  ruPj-  '  =  1 . a’ 

j  J 

£  rtj  log  9ij  =  r j(fx  +  /3y)  +  rtJa„  j  =  1 .  6, 

/  / 

subject  to  the  restrictions  Z  r/  =  Z  r  jPj  =  0.  Comments  on  the  solutions  of  such  equa- 

I 

tions  are  given,  for  example,  by  Kempthorne  (6,  pp.  80-81).  From  the  m.l.e.’s  we  obtain 

A4  =  ZZ  fn  log  +  ZZ 

ii  ii 

=  r  .  .  fx  +  r  .  . 

Note,  in  particular,  that  in  order  to  obtain  A4,  it  is  not  necessary  to  obtain  anything  other  than 
fx.  This  is  noteworthy,  since  often  (see  following)  we  may  wish  to  test  //3  against  //4,  and  then 
to  discuss  only  H}  further,  if  H}  is  accepted. 


Hy  The  maximum  likelihood  estimates  of  ix,  a,,  and  P,  under  H}  cannot  be  given  in 
closed  form,  and  must  be  determined  numerically.  To  maximize  log  L  under  the  restriction  on 
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the  a/’s  and /3/s  we  consider 

log  L  -  -  r  .  .  ix  -  ££  T>j  exp(-  M  ~  «.  -  Pj)> 

i  j 

where  we  consider,  through  Eq.  (6),  that  aa  - - —  £  r,  a,  and  - - —  £  r  fir  That 

i&Q  f.b  j&b 

is,  ft,  a, . aa_,  and  /3, . /34_,  are  considered  as  the  unknown  parameters  and  aa  and  Pb 

are  given  in  terms  of  these.  Differentiation  of  log  L  gives  equations 

^ + XI  7i  e*p<- 0. 

(10)  -  -  X  /.j  exp(-  /*-«.-) 3;)  -  ~  X  ^  exp(-  i it  -  aa  -  /3.J. 

v®/  y  -  »a.  y 

/  —  1  ,  . . a  —  1, 

- ~  =  I  exP(~  M  “  <*/  "  0y)  ~  ~-  £  7;a  exp(-  /*-«/-  0*). 

VPj  t  r-b  i 

j=  1,  ....  A  —  1. 

These  equations  have  to  be  solved  iteratively.  This  is  easily  done,  for  example,  using  Newton’s 
method  [7,  Ch.  18].  We  note  that  again  in  this  case,  A3  is  of  the  form 

A  3  -/■..£  +  r  , 

where  /2  is  the  maximum  likelihood  estimate  of  ft  obtained  from  expression  (10).  To  see  this, 
observe  that  the  first  equation  in  expression  (10)  implies  II  Tij  exp (~  fi.  -  d/  -  Pj)  =  r  .  ., 

i  j 

so  that  substituting  the  m.l.e.’s  in  log  L  gives  A3  as  stated. 

H2.  In  this  case  we  need  to  consider 

log  L  -  -  r  .  .  ft  -  ££  TtJ  exp(—  ft  -  Pj ) 

i  j 

where  again,  using  Eq.  (6),  we  treat  j3Ip  ...,  pb- 1  as  the  unknown  parameters,  with  fib  — 
- —  I  r  fij.  It  is  easily  found  that  log  L  is  maximized  for  ft  and  p\,  ...  ,  pb  by  choosing 

r.b  j 

fL  =  £  rj  log (Tj/rj) 

r  .  .  j 

and 

Pt  -  \o%(TJrj)  -  fi,  j  -  1 . b. 

where  Tj  —  £  .  A2  then  once  again  equals  r  .  .  ft  +  r  .  .  . 

i 

H |.  We  similarly  find  here  that  maximizing  log  L  leads  to  estimates 
{t  -  log(7’,/r,) 

a,  -  \og(TJ r,  )  -  ft,  i  =  1 ,  a. 
where  T, i  —  £  7;y.  Then,  Aj  —  r  .  .  ft  +  r  .  .  . 
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H0:  In  this  case  we  find/2  =  !og(7.  Jr.  .)  and  A0  =  r.  .  /2  +  r.  . ,  where  T.  .  =  XX  7/ 

i  i 

Utilizing  the  expressions  given  above,  we  can  test  any  form  of  the  model  against  any 
other.  We  will  often  want  to  start  by  testing  // 3  vs  //4.  Acceptance  of  Hi  would  mean  that 
there  is  no  interaction,  in  a  sense,  between  factors  A  and  0,  and  this  would  make  it  meaningful 
to  consider  questions  about  the  a/s  and  0/s  individually.  If  H}  is  accepted  we  might,  for 
example,  then  want  to  examine  the  hypothesis  that  Factor  A  has  no  effect.  This  would  be  done 
by  testing  H2  vs  H2.  Similarly,  we  might  want  to  test  H\  vs  Hi.  The  likelihood  ratio  method  is 
a  convenient  way  to  do  this,  and  examples  of  the  above  tests  are  given  in  Section  6. 

REMARK:  Zelen  [17]  gives  likelihood  ratio  tests  like  those  discussed  here,  for  the  case 
in  which  all  r/s  are  equal.  It  should  be  noted  that  the  statistic  M3  given  on  p.  513  of  his  paper 
is  incorrect,  however,  as  is  the  expression  for  m3  on  p.  512.  This  error  arises  because  Zelen 
obtains  incorrect  expressions  for  his  parameter  estimates  under  model  //3.  In  fact,  he  uses 
closed-form  expressions  for  the  estimates,  though  as  noted  above,  it  is  not  possible  to  obtain 
these. 

4.  LARGE  SAMPLE  MAXIMUM  LIKELIHOOD  PROCEDURES 

Likelihood  ratio  procedures  are  convenient  for  testing  hypotheses  concerning  main  effects 
and  interactions  in  the  model  (5),  but  are  less  convenient  when,  for  example,  it  is  desired  to 
estimate  contrasts  involving  the  parameters  or  to  obtain  confidence  intervals  for  parameters. 
We  discuss  here  procedures  based  on  large  sample  properties  of  the  m.l.e.’s,  which  can  be  used 
to  test  overall  effects  in  the  model  or  for  estimation  purposes. 

We  begin  by  considering  the  full  model  (5),  in  which  case  the  log  likelihood  is,  trout  Eq. 
(7), 

log  Lip.,al.PJ,y,j)  -  -  XL  r'M  +  a,  +  0,  +  7  ,7)  “XL  T'j  exP(“  M  “  «/  “  0>  “  >/• 

i-lj-l  (-1/-1 

The  parameters  satisfy  the  linear  restrictions  (6).  In  what  follows  we  will  suppose  that  the  ab 
unknown  parameters  in  the  model  are  (i,a|„..,«H,  0t,. . .  .  0h_i,  and  y  tJ 

(/  -  1,  . . .  .a  -  l;y  —  1 . 6-1).  The  remaining  parameters  can  be  given  in  terms  of 

these  through  Eq.  (6). 

Using  standard  asymptotic  properties  of  m.l.e.’s,  the  covariance  matrix  of  the  limiting  nor¬ 
mal  distribution  for  (/2,a, . «o-b0i . 0«.-b7ii . yu-mt-i))  is  given  by  the  inverse  of 

the  matrix  of  negative  expected  second  deviatives  of  log  L.  This  is  found  here  to  be  of  the 
form 

| r..  0  0  0)“' 


|o  0  0  q 

where  the  estimates  are  written  in  the  order  given  above,  and  V  is  therefore  partitioned  so  that 
A  is  (o  -  1)  x  (a  —  1),  B  is  (b  —  1)  x  (b  -  1),  and  C  is  (a  -  1)  (b  -  1)  x  (a  —  1)  (b  -  1). 
Specifically,  the  entries  in  the  matrices  A,  B.  and  D  are 


(12) 


-  7  - 
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S,,= 


D,j.- 


4 

4 


log  L 

apfiPj 

-a2  log  l 


('  *  Bn  =  +  — - 


C  _  ri.  ^ gj  ^  r.jrab 

r.b  ra  ra.  rb 


The  expression  for  Cy  is  more  complicated  and  will  not  be  given  here,  since  we  do  not  require 
an  explicit  expression  for  it  in  the  sequel.  The  formula  for  Q  is,  however,  precisely  the  same 
as  the  expression  given  in  Kendall  and  Stuart  [8,  Section  35.79]  for  the  case  of  least  squares 
regression  with  unequal  replication.  We  also  note  that  in  the  "proportional  frequency”  case, 
when  rtJ  =  r,  r  Jr . .,  things  simplify  to  a  degree,  since  then  £(-32  log  L/3a,d/8;)  =  0,  so  that 
D  =  0. 


An  alternate  way  to  obtain  expression  (11)  is  by  analogy  with  least  squares  methods  for 
unbalanced  two-way  models.  Since  9,,  —  Ttl/rn  has  a  gamma  distribution,  log  9,,  has  a  log 
gamma  distribution  (e.g.,  see  Bartlett  and  Kendall  [2])  with  mean  and  variance  given  by 

(13)  £(log  9,j)  =  log  By  -  log  r,j  + 

Var(log0y)  =  i/Ary), 

where  </»(■)  and  i /»'(•)  are  the  digamma  and  trigamma  functions  (see,  e.g.,  Abramowitz  and 
Stegun  111).  These  functions,  incidentally,  have  series  representations 


(14) 


> Mty)  =  log  rtj  - 


1 

2ru 


1 

12  ^ 


ip'(rij)  =  ~~  +  ~  +  + 

r0  2r,f  6rj 


The  maximum  likelihood  estimates  of  /2,  a,,  4y,  Ty,  given  by  expressions  (8)  and  (9),  are 
linear  combinations  of  the  log  0,/s.  Hence  the  variances  and  covariances  of  these  estimates  are 
in  principle  readily  determined.  The  calculations  are  somewhat  tedious,  though  if  we  note  that 
the  calculations  required  are  just  those  required  in  the  unbalanced  two-way  analysis  of  variance 
model 


(15)  Y0  -  log  9/j  =  fi  +  a,  +  Pj  +  y u  +  eu, 


where  e,j  —  N( 0, 1/ rt] ) ,  then  we  can  make  use  of  standard  techniques  for  this  model  (see,  e.g., 
[6,  pp.  79-91]  or  [8,  Chapter  35]). 


The  statistic  for  testing  for  absence  of  interaction  (i.e.,  that  y{J  =  0,  /  =  1 . a\ 

7=1,  —  b)  is,  in  terms  of  expression  (11),  S0  —  n'  Cn ,  where  n  =  (yn,  -  -  - .  y 
y 2i ,  •  •  • ,  ■yio-ix*-!))-  This  turns  out  to  be  equal  to 

(16)  Sq  =£  £  ry  yfj, 

1  7-1 

where  ytJ  is  as  given  in  expressions  (8)  and  (9).  Asymptotically,  50  is  distributed  as  X&-i)(4-n 
if  H 3  is  true,  and  large  values  of  S0  give  evidence  against  H}. 

Variances  of  contrasts  involving  the  m.l.e.’s  can  be  determined  from  V.  In  addition,  tests 
concerning  the  «,’ s  and  /3y’s  can  also  be  carried  out.  However,  tests  concerning  the  a,' s  and 
pj's  are  unlikely  to  be  of  much  interest  except  when  the  interaction  terms  yu  are  0.  Therefore, 
in  discussing  these  quantities  we  will  suppose  that  we  are  in  situations  where  it  has  been  found 
suitable  to  work  with  the  "main  effects"  model  //j. 
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The  equations  for  obtaining  the  m.l.e.’s  under  model  H2  have  already  been  given  in 
expressions  (10).  Test  and  estimation  procedures  can  be  based  on  the  limiting  normal  distribu¬ 
tion  of  (ji.d,, _ da_ , j§, . £i>- )).  The  covariance  matrix  for  this  limiting  distribution  is, 

as  in  the  case  of  the  full  model,  obtained  from  the  expectations  of  the  second  derivations  of  log 
L.  In  this  case,  the  covariance  matrix  is  of  the  form 

t-i 


(17) 


V,  = 


0 

A  D 
D  B 


where  Vx 


is  dimension  (a  +  b  —  1)  x  (a  +  b  -  1),  and  A,  B ,  and  D  are  as  given  in  expres¬ 
sions  (12).  The  matrix  K,  is  readily  inverted  numerically  in  any  specific  application.  In  the 
proportional  frequency  design  mentioned  earlier,  in  which  case  rtJ  —  r,  rjr..,  we  have  D  =  0 
so  that 

A.’1  0  0 

(18)  F,  =  0  A-'  0 

0  O'  fl-'J 

Further,  it  can  be  shown  (e.g.,  see  Kendall  and  Stuart  [8,  p.  18])  that  A~x  and  B~]  have  entries 


A-± 

r, 


B“=  — 

r, 


r.. 

J_ 

r. . 


4/“ 


BV-~ 


\_ 

r.. 

J_ 

r.. 


( i  *  j), 
(i  ^  j)- 


In  the  general  case  (nonproportional  frequencies),  Vx  will  have  the  form 
-•  0  0 


(19) 


r.. 

0 

0 


E  F 
F  G 


Tests  of  the  models  //,  or  H2  vs  H}  can  be  carried  out  by  employing  the  limiting  normal 
distribution  of  the  m.l.e.’s.  To  test  H2.  log  9 (J  =  fi  +  p2  vs  H}  (that  is,  we  are  testing  a,  =  0 
within  the  model  //3),  we  consider  the  statistic 

S2  =  a  F  ]  a, 

where  a  —  (a, . aa-i)  is  the  m.l.e.  of  a  under  Hy  If  H2  is  true,  S2  is  approximately 

xL-n-  Similarly,  to  test  Hx  vs  H2  =  0),  we  consider 

Si-fi'  G-'jl 

which,  if  H\  is  true,  is  approximately  xh-u- 


Contrasts  involving  the  <*,’s  or  fi/s  are  also  easily  handled.  If  we  are  interested,  say,  in 
the  contrast  <p  -  /,<* ,  +  ...  /a_iaa_|,  then  we  consider  the  approximate  normal  distribution  of 
<#>  =  /,«,  +  ...  l„_xaa-\,  which  is  easily  obtained  from  the  approximate  joint  normal  distribu¬ 
tion  of  (a . ,a„-i).  In  particular,  has  a  limiting  normal  distribution  with  mean  <t>  and 

variance  £  E  J  ,  where  J  =  (l\ . 4-i)  - 

A  final  remark  is  that,  since  VarOog  9U )  is  exactly  equal  to  i and  only  asymptotically 
equal  to  l/r0,  it  may  bt  preferable  to  use  t//'(ry)  instead  of  ry  in  the  above  calculations,  unless 
the  ry’s  are  fairly  large.  In  this  case  V  will  be  the  exact  covariance  matrix  for  the  estimates, 
and  not  just  an  approximation,  although  this  does  not  of  course  imply  that  So  will  necessarily  be 
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more  closely  approximated  by  a  x2  distribution  than  before.  In  the  case  of  F,  use  of 
does  not  give  the  exact  covariance  matrix  of  the  m.l.e.’s,  so  it  is  doubtful  whether  the  above 
modification  is  worth  considering. 

5.  LEAST  SQUARES  ANALYSIS 

Least  squares  methods  can  be  conveniently  used  with  the  types  of  models  discussed  here. 
We  consider  this  only  briefly,  since  least  squares  analysis  is  essentially  the  same  as  the  approxi¬ 
mate  normal  analysis  using  the  model  (15)  described  in  Section  4. 

We  consider  the  distribution  of  log  Oy,  and  write 

(20)  log  6,j  =  log  e,j  -  log  r,j  +  e'jj 

where  has  a  log  gamma  distribution  with  mean  0 ( r(/ )  and  variance  0 '( r The  model  (20) 
can  be  rewritten  as 

(21)  Yjj  —  iOg  Q  jj  "P  €y’, 

where  Y{J  =  log  9tJ  +  log  r0  -  i//(r„)  and  =  t'tJ  -  i li(rtJ).  In  view  of  expressions  (14)  we 
note  that 

and  e,j  has  mean  0  and  variance  0 ' ( /"^ )  =  U ru  +  1/2 rj  +  . . .  .  Least  squares  analysis  is  carried 
out  by  considering  the  model  (21)  and  using  weighted  least  squares  with  weights  w,j  =  l/t//'(ry). 
Tests  of  hypotheses  can  be  carried  out  by  treating  the  e,/s  as  being  approximately  normally  dis¬ 
tributed.  We  observed  that  this  analysis  is  similar  to  the  approximate  normal  theory  analysis 
for  model  //4  described  in  Section  4,  except  that  rl}  is  replaced  by  w:J,  and  log  9  tJ  is  replaced  by 
log  90  +  log  r,j  -  0  ( ) .  The  covariance  matrix  for  the  least  squares  estimates  of 

M.  «i . a„_|,  /3i . y,, . y {a- 1)(6- 1)  is  given  by  expression  (11),  except  that  r0 

is  replaced  by  w0  in  Eqs.  (12).  This  method  and  the  approximate  normal  methods  of  Section  4 
do  not  differ  by  much,  unless  the  r,} ’s  are  fairly  small,  since  to  first  order  (14),  </>'(ry)  =  l/ry. 

For  more  details  on  weighted  least  squares  in  the  two-way  model,  we  refer  the  reader  as 
before  to  Kempthorne  [6,  pp,  79-91]  or  Kendall  and  Stuart  18,  Ch.  35]. 

6.  AN  EXAMPLE 

In  handling  data  from  the  model  (5),  we  have  two  main  choices  with  regard  to  overall  test 
of  models  //5  vs  H,:  one  choice  is  to  use  likelihood  ratio  tests,  the  other  to  use  tests  based  on 
the  approximate  normality  of  the  m.l.e.’s  or  least  squares  estimates  of  the  parameters.  Both 
methods  require  a  moderate  amount  of  computation.  For  example,  in  using  likelihood  ratio 
statistics  we  may  need  to  solve  equations  iteratively  to  obtain  m.l.e.’s.  If  we  use  least  squares 
estimates,  this  is  not  necessary,  though  we  do  have  to  solve  linear  equations  for  the  estimates. 
However,  when  we  use  a  statistic  such  as  S\  or  S2,  we  need  to  invert  matrices,  which  is  not 
required  for  the  likelihood  ratio  tests.  In  order  to  estimate  contrasts  for  parameters,  maximum 
likelihood  or  least  squares  methods  can  be  used.  In  both  cases  it  is  necessary  to  obtain  the 
covariance  matrices  V  or  V\,  and  to  invert  matrices.  The  least  squares  estimates  are  slightly 
more  easily  calculated,  if  model  H)  is  used,  though  they  are  also  slightly  less  efficient  than  the 
m.l.e.’s  (see  Section  7). 

We  will  now  demonstrate  the  use  of  the  procedures  in  an  example,  and  return  briefly  to  a 
further  discussion  of  the  merits  of  the  procedures  in  Section  7. 
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EXAMPLE :  We  consider  for  convenience  a  simulated  3x3  factorial  experiment  with  9:J 
—  exp(2i  +  2 j),  i  -  0,  1,  2  ;  j  -  0,  1,  2.  The  values  of  Ttj  and  r0  (in  brackets)  are  given 
below: 


0 

B 

1 

2 

T,  (r, ) 

0 

9.61 

(10) 

48.14 

(6) 

102.7 

(4) 

160.45 

(20) 

A  1 

46.88 

(6) 

281.7 

(4) 

463.7 

(2) 

792.28 

(12) 

2 

254.7 

(4) 

807.8 

(2) 

8431.0 

(2) 

9493.50 

(8) 

T,  (r,) 

311.19 

(20) 

1137.64 

(12) 

8997.4 

(8) 

10446.23 

(40) 

We  begin  by  testing  for  absence  of  the  interaction  term  in  the  model  log  9tJ  =*  n  +  a, 

+  / 3j  +  -y,y.  Using  the  likelihood  ratio  test  of  H3  vs  //4,  we  find  X34  =  2(A3  -  A4)  =  1.227. 

Comparing  this  with  percentage  points  of  the  distribution,  we  see  there  is  no  evidence 
against  //3,  and  hence  in  further  discussions  we  will  work  with  model  H}.  We  note  that  in 
order  to  calculate  X34  we  had  to  compute  the  m.l.e.  /I  under  model  Ha,  (see  Eqs.  (8)),  and  also 
the  m.l.e. ’s  of  fi,  fij  under  //3.  These  latter  quantities  are  found  by  solving  Eqs.  (10)  to  be 
/I  =  2.709,  a,  =  -  1.521,  a2  -  0.625,  «3  =  2.864,  /§,==  —  1.352,  /32  =  0.767,  and  /§3  = 
2.228. 

Alternately,  we  could  test  //,  vs  H4  by  using  the  large  sample  approximate  normal  distri¬ 
butions  of  either  the  m.l.e. ’s  or  the  least  squares  estimates.  Using  S0  given  by  Eq.  (16),  we 
find  S0  —  1.207,  which  is  in  close  agreement  with  the  likelihood  ratio  test  result.  Using  S0,  but 
using  the  least  squares  estimates  and  their  variances  instead  of  the  m.l.e. 's,  gives  So  ~  0.935, 
in  good  agreement  with  the  other  results.  The  least  squares  estimates  of  the  parameters  under 

model  //3  are,  incidentally,  /i  =  2.747,  d,  =  -  1.476,  d2  =  0.726,  d3  =  2.972,  /3,  =  -  1.316, 

/32  =  0.851,  and  /33  =  2.323. 


Effects  within  the  model  H3  can  be  examined  further  using  either  the  m.l.e.’s  or  least 
squares  estimates.  We  will  illustrate  here  the  use  of  the  m.l.e.’s;  as  noted  above,  least  squares 
methods  are  computationally  similar  to  these.  The  m.l.e.’s  of  the  parameters  have  been  given 
above.  We  also  require  the  covariance  matrix  Kf1,  which  is  readily  found  from  expressions 
(17)  and  (12)  to  be 


h- 


0.0250 

0 

0 

0 

0 


0  0 

0.0250  -0.0250 
-0.0250  0.0586 

0.0  0.0 

0.0  -0.0028 


0  0  1 

0.0  0.0 

0.0  -0.0028 
0.0250  -0.0250 
-0.0250  0.0586 


Overall  tests  of  the  main  effects  A  and  B  can  be  carried  out  using  either  statistics  S|  and  S2,  or 
the  likelihood  ratio  tests.  The  former  requires  the  inversion  of  some  2x2  matrices,  but  does 
not  require  any  new  estimates  to  be  calculated.  The  latter  method  requires  us  to  calculate  new 
m.l.e.’s  under  H2  or  H |,  but  no  matrix  inversion.  In  this  and  similar  problems,  the  amount  of 
computation  required  by  the  two  methods  is  comparable,  and  the  two  methods  will  usually  give 
results  in  fairly  close  agreement,  so  it  is  simply  a  matter  of  preference  which  procedure  one 
uses. 


Using  the  likelihood  ratio  test  for  H2  vs  //3,  we  find  X23  =  108.9,  and  for  H |  vs  H}  we 
find  X,,  =  74.6.  The  corresponding  values  of  S|  and  S2  are  116.4  and  83.3;  use  of  i//’(r,7) 
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instead  of  \/r:j  in  K,,  as  discussed  at  the  end  of  Section  4,  changes  these  only  marginally,  to 
117.2  and  83.4,  The  upper  0.01  percent  point  \h)  is  9.21,  so  that  both  tests  show  strong  evi¬ 
dence  in  favor  of  main  effects  for  both  A  and  B. 

If  desired,  contrasts  involving  the  parameters  can  also  be  examined.  For  example,  in  the 
present  situation  it  would  be  of  interest  to  consider  constrasts  which  correspond  to  linear  and 
quadratic  effects  in  A  and  B.  For  example,  for  A  we  might  consider  <j>LA  -  a3  -  a]  and  <t>QA  = 
«3  -  2a 2  +  «|.  Estimates  of  these  contrasts  and  their  variances  are  easily  obtained  from  d|, 
a2,  <5 3,  and  V\,  and  are  as  follows: 

$la  ~  «j  ~  «i  =  4.385,  Var(4>LA)  =  0.1755, 

4>qa  =  aj  —  2 d2  +  d]  =  0.0931,  Var(<j>QA)  —  0.1755. 

In  calculating  variances  of  4>La  and  we  make  use  of  the  restriction  20<*|  +  12a2  4-  8a3  = 
0  to  rewrite  4>tA  as  —  3.5d)  -  1.5d2  and  $qA  as  -  15d,  -  3.5d2.  Tests  of  <$>LA  =  0  and  <f>LA  — 
0  can  be  carried  out  by  treating  4>U  Vaii4>LA)  and  4>qa/  Var(<)>QA)  as  approximately  \hh  Here 
we  find,  that  <j>[4/  Var(<t> LA)  —  109.5,  and  4>qa/ Var(<j>QA)  —  0.017,  indicating  very  strong  evi¬ 
dence  in  favor  of  a  linear,  but  no  quadratic,  effect  on  log  9  for  factor  A.  (This  is  of  course  as 
we  expect,  since  the  simulated  data  come  from  the  model  log  9 „  =  2/  +  2 j.) 

In  a  model  in  which  there  is  no  interaction  effect,  it  may  also  be  of  interest  to  estimate, 
say,  a,  -  a,.  Note  that  a,  -  a,  =  log  9ik  -  log  6 jk  =  log  (9:k/9 Jk)  represents  the  log  of  the 
ratio  of  the  mean  lifetime  of  items  at  levels  /'  and  j  of  factor  A  (and  the  same  level  of  factor  B). 
For  example,  above  we  have  found  that  d3-d,  =  4.385;  if  we  desire  an  approximate 
confidence  interval  for  a3  -  ab  we  find  it  using  Var(<i>  LA )  *“  0-  1755,  so  that  an  approximate 
0.95  confidence  interval  for  «3  -  at  is  given  by  <j>LA  ±  1.96  V Var(4>LA).  In  this  case,  this 
yields  the  interval  3.564  <  a3  -  ai  <  5.206.  This  gives  confidence  limits  on  the  ratio  S3yt/Su 
of  mean  lifetimes  of  items  at  levels  1  and  3  of  A  as  35.30  <  93k/0lk  <  182.4. 

7.  ADDITIONAL  REMARKS 

The  procedures  described  are  all  easy  to  use.  The  least  squares  procedures  do  not  require 
any  iterative  solutions  for  estimates,  though  with  an  unbalanced  design  the  computation 
required  for  the  least  squares  analysis  is  not  much  less  than  that  required  by  the  maximum 
likelihood  or  likelihood  ratio  procedures.  The  maximum  likelihood  methods  are,  in  experi¬ 
ments  having  many  factor  combinations,  more  efficient  than  the  least  squares  procedures,  espe¬ 
cially  when  the  r,' s  are  not  large  [3].  All  the  procedures  described  rely,  however,  on  large 
sample  approximations,  and  so  the  question  arises  as  to  how  well  various  statistics  are  approxi¬ 
mated  by  their  limiting  x1  or  normal  distributions.  The  log  gamma  distribution  of  9 „ 
approaches  normality  fairly  rapidly  as  r,;  increases,  so  that  with  even  moderate  sized  r„’s  (say 
about  8  or  more),  the  approximations  employed  should  be  sufficiently  accurate  for  practical  pur¬ 
poses.  Zelen  [16]  has  performed  a  few  simulations  which  support  this,  though  unfortunately 
some  of  his  results  refer  to  an  incorrect  likelihood  ratio  statistic.  For  even  smaller  values  of  r,n 
the  approximations  should  be  reasonably  accurate,  though  in  this  case  x 2  approximations 
involved  with  likelihood  ratio  tests  might  be  expected  to  be  slightly  better  than  those  involved 
with  the  distributions  of  m.l.e.'s  or  least  squares  estimates. 

We  also  remark  that  although  we  have  restricted  our  discussion  to  a  two-way  model,  more 
general  analysis-of-variance  or  regression  models  can  be  handled  in  a  similar  way.  Hence  the 
large  number  of  models  which  are  of  the  form  (1),  with  log  9  a  linear  function  of  fixed  effects 
or  covariables,  can,  for  example,  be  readily  handled. 
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Finally,  no  work  on  the  exponential  distribution  is  complete  without  a  reminder  that  pro¬ 
cedures  based  on  the  exponential  model  are  rather  nonrobust.  That  is,  they  are  sensitive  to 
departures  from  exponentiality.  This  is  equally  true  of  the  maximum  likelihood  and  the  least 
squares  procedures.  Therefore,  in  using  procedures  such  as  those  discussed  in  this  paper, 
checks  need  to  be  made  on  the  adequacy  of  the  assumed  exponential  model. 
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ESTIMATING  THE  ECONOMIC  IMPACT  OF  THE  1973  NAVY  BASE 
CLOSING:  MODELS,  TESTS,  AND  AN  EX  POST  EVALUATION 
OF  THE  FORECASTING  PERFORMANCE 
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ABSTRACT 

In  1973  the  Defense  Department  made  plans  to  close  many  Navy  bases  in 
the  United  States.  Hardest  hit  was  Rhode  Island  which  would  suffer  a  loss  of 
45.6 1  %  of  the  total  cutback  of  42,812  jobs.  This  paper  describes  two  models 
which  were  built  to  forecast  the  severity  of  the  economic  impact  in  Rhode  Is¬ 
land:  one  used  the  reduced  form  equation  approach,  and  the  other  the  simul¬ 
taneous  equations  system  approach.  Tests  on  multicollinearity,  specification, 
and  serial  correlation  were  conducted.  An  ex  post  evaluation  of  these  two 
models’  performance  in  forecasting  then  concludes  the  paper. 


1.  INTRODUCTION 

On  April  16,  1973,  the  Defense  Department  announced  plans  to  close  or  modify  installa¬ 
tions  in  30  states,  the  District  of  Columbia,  and  Puerto  Rico.  It  would  eliminate  26,172  civilian 
and  16,640  military  jobs  by  June  30  of  the  next  year.  Hardest  hit  by  this  cutback  was  Rhode 
Island,  whose  Quonset  Point  Naval  Air  Station  would  be  shut  down  with  a  loss  of  3,809  civilian 
and  4,217  military  jobs,  and  whose  Newport  Naval  Base  would  be  severely  cut  down,  eliminat¬ 
ing  430  civilian  and  11,069  military  positions.  This  meant  that  Rhode  Island  would  suffer  a  pri¬ 
mary  job  loss  of  4,239  civilian  and  15,286  military  employments  with  a  total  of  19,525,  which 
was  45  61%  of  the  total  cutback  of  42,812  jobs.  This  announcement  was  greeted  in  Rhode 
Island  with  "shock,  anger,  and  bewilderment,"  and  was  ranked  to  be  as  catastrophic  as  "the 
disastrous  hurricanes"  experienced  by  the  state  in  the  past. 

A  study  was  conducted  in  the  summer  of  that  year  to  analyze,  to  assess,  and  to  forecast 
the  severity  of  the  economic  impact  of  the  Navy  base  closing  in  Rhode  Island.  The  results 
were  reported  as  a  headline  story  in  The  Providence  Sunday  Journal  [2]  on  August  5,  1973. 

Two  forecasting  models  were  built  for  this  economic  impact  study:  one  for  the  employ¬ 
ment  and  the  other  for  the  retail  sales  subject  to  taxation.  However,  only  the  employment 
model  will  be  discussed  here. 
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2.  IMPACT  MULTIPLIER,  REDUCED  FORM  EQUATION, 

AND  SINGLE  EQUATION  LEAST  SQUARES  METHOD 

2.1  Specification  of  the  Model 

In  the  employment  model,  a  five-equation  system  is  built  as  follows:  the  first  equation  is 
an  identity  (Eq.  (2.1)),  which  defines  the  total  nonagricultural  wage-and-salaried  employment 
(D  in  the  state  as  consisting  of  manufacturing  (A/)  and  nonmanufacturing  (AO  employment; 
the  latter  includes  Navy  civilian  workers  (C)  as  a  part  of  its  government  employment,  but  not 
Navy  military  personnel  (£>).  Thus,  nonmanufacturing  (AO  employment  is  equal  to  Navy  civi¬ 
lian  workers  (C )  plus  the  remaining,  which  we  shall  call  service  (S )  employment: 

(2.1)  T-M  +  N  =  M  +  S  +  C, 
where  N  —  S  +  C. 

Based  upon  past  research,  it  has  been  found  that  Rhode  Island  manufacturing  ( M ) 
employment  is  strongly  subject  to  cyclical  influence  but  her  nonmanufacturing  (AO  employ¬ 
ment  is  not.  Therefore,  it  is  hypothesized  that  in  Eq.  (2.2)  manufacturing  (A/ )  employment  is 
a  function  of  both  time  trend  (i)  and  a  cyclical  dummy  variable  (K),  using  1  as  downswing  and 
0  as  upswing: 

(2.2)  M  =  a  +  b  t  +  c  V  +  u2. 

In  Eq.  (2.3),  service  ( S )  employment  is  assumed,  following  the  economic  base  theory,  to 
serve  on  a  supportive  basis  for  the  state’s  total  (T)  employment  and  Navy  military  personnel 
(D).  Thus,  we  have 

(2.3)  S-  h  +  k  T  +  w  D  +  u3. 

Finally,  Navy  civilian  workers  (C)  and  military  personnel  (D)  are  taken  to  be  exogenous 
as  they  are  specified  in  Eqs.  (2.4)  and  (2.5): 

(2.4)  C  =  C, 

(2.5)  D-D. 


2.2  Estimation,  Testing,  and  Evaluation 


Since  the  immediate  interest  back  in  1973  was  the  impact  effect  of  the  Navy  base  closing, 
Eqs.  (2.1)  and  (2.2)  were  substituted  into  Eq.  (2.3)  and  solved  for  S  in  terms  of  /,  V,  C  and  D  : 

(2.6)  S  =  V  +  -0-  C  +  -2^-  D  +  -;+“3. 

1  -  *  l  -  k  l  -  k  1  -  k  l-  k  1  -  k 


Using  time  series  data  from  1961  to  1972,  this  reduced  form  equation  for  the  service  (S) 
employment  was  estimated  by  the  single  equation  least  squares  method.  It  turns  out  to  have  an 
annual  growth  factor  of  5.647,  and  a  recession  coefficient  of  3.928.  The  Navy  civilian  and  mili¬ 
tary  employment  impact  multipliers  are  estimated  to  be  0.499  and  0.216,  respectively.  All  these 
statistics  are  presented  in  Table  1. 


The  “f"  test  for  time  trend  (r)  is  found  to  be  statistically  significant  at  more  than  1% 
level;  for  cyclical  effect  (F),  significant  at  5%  level;  for  Navy  military  personnel  (D)^ significant 
at  about  30%  level;  and  for  Navy  civilian  workers  (C),  not  significant  at  all.  The  R2  is  0.993, 
and  the  calculated  Fwith  4  and  7  degrees  of  freedom  is  392.721. 
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TABLE  1  —  Estimation  of  Impact  Multiplier 
and  Reduced  Form  Equation 


Service  Employment  Equation* 

Variable 

Estimated  Coefficient 

t  statistic 

T-intercept 

147.444 

— 

Time 

5.647 

12.915 

One,  if  a  downswing 

-3.928 

-2.645 

Navy  civilian  workers 

0.499 

0.311 

Navy  military  personnel 

0.216 

1.018 

* R 2  -  0.993,  F{ 4.7)  -  392.721,  D.W.  =  2.049,  SEE.  -  1.639. 


To  compare  the  ex  ante  forecast  which  was  reported  by  Goodrich  [2]  against  the  ex  post 
forecast.  Tables  2  and  3  are  prepared. 


TABLE  2  —  Ex  Ante  Forecasting  of  Secondary  Job  Loss 
Based  on  Planned  Reduction 


Planned  Reduction 

Secondary 

Year 

Category 

Civilian 

Military 

Multiplier 

Job 

Workers 

Personnel 

Loss 

Civilian 

1973 

Workers 

Military 

2,518* 

0.499 

-1,257 

Personnel 

12,244t 

-2,645 

Total 

2,518 

12,244 

— 

-3,902 

Civilian 

1974 

Workers 

Military 

1,721* 

0.499 

-859 

Personnel 

3,042t 

0.216 

-657 

Total 

1,721 

3,042 

-1,516 

'Total  planned  reduction  in  Navy  civilian  workers  in  1973  and  1974  is  distributed  accord¬ 
ing  to  the  percentage  calculated  from  the  actual  reduction  in  1973  and  1974. 

7Same  as  above  except  it  is  related  to  Navy  military  personnel. 


TABLE  3  —  Ex  Post  Forecasting  of  Secondary  Job  Loss 
Based  on  Actual  Reduction 


Actual  Reduction 

Secondary 

Year 

Category 

Civilian 

Military 

Multiplier 

Job 

Workers 

Personnel 

Loss 

Civilian 

1973 

Workers 

Military 

3,504 

0.499 

-1,749 

Personnel 

18,523 

0.216 

-4,001 

Total 

3,504 

18,523 

— 

-5,750 

Civilian 

1974 

Workers 

Military 

2,395 

0.499 

-1,195 

Personnel 

4,609 

0.216 

-996 

Total 

2,395 

4,609 

— 

-2,191 

L 
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It  is  interesting  to  note  that  first,  the  actual  reduction  in  civilian  workers  and  military  per¬ 
sonnel  by  the  Navy  is  greater  than  the  announced  planned  reduction;  and,  consequently,  the  ex 
ante  forecast  of  the  secondary  job  loss  is  smaller  than  the  ex  post  forecast.  This  shows  once 
again  that  the  conditional  forecasting  is  different  from  the  unconditional  forecasting.  Second, 
among  the  many  other  forecasts  made  in  1973,  our  ex  ante  forecast  turned  out  to  be  the  most 
optimistic  one  in  the  sense  that  the  economic  impact  of  the  defense  base  closing  would  be  the 
least  severe.  Had  our  forecast  used  the  actual  reduction  figure,  it  would  have  sounded  more 
alarm.  But  events  proved  to  be  the  contrary.  For  instance,  in  1973  the  secondary  job  loss 
should  have  been  forecast  to  be  5,750  based  on  the  actual  reduction  figure  (not  3,902  based  on 
the  planned  reduction  figure),  and  the  ex  post  forecast  for  Rhode  Island  total  employment 
should  have  been  356,351;  but  the  actual  observed  total  employment  was  365,900,  and  the 
underestimation  was  9,549.  Thus,  we  have  to  conclude  that  either  Rhode  Island’s  inner 
economic  strength  is  actually  stronger,  or  the  economic  impact  of  the  defense  base  closing  is 
much  weaker  than  suspected,  or  both. 

Since  a  high  degree  of  multicollinearity  implies  that  at  least  one  of  the  explanatory  vari¬ 
ables  is  a  linear  function  of  one  or  more  of  the  remaining  explanatory  variables  plus  residual,  to 
run  the  test  for  multicollinearity  in  our  sample,  we  regress  each  of  the  explanatory  variables  on 
all  the  remaining  explanatory  variables.  The  functional  forms  and  the  R2  values  are  listed  in 
Table  4.  The  highest  R2  is  0.865,  which  may  be  taken  as  a  measure  of  the  multicollinearity  in 
our  sample. 


TABLE  4  —  Multicollinearity  Test  for 
the  Service  Employment  Equation 


Functional  Form 

R2 

c  =  gu.  y,  D) 

0.814 

D  =  hit,  V,  C) 

0.344 

t  =  /( V,  C,  D) 

0.865 

v  =  j(t,  C,  D ) 

0.005 

TABLE  5  —  Evaluation  of  the  Forecasting  Performance 
of  the  Reduced  Form  Equation 


Equation 

Year 

Forecast 

Value 

Actual 

Value 

Error 

Percent 

Error 

Service 

1973 

225.642 

235.191 

-9.549 

-0.041 

Employment 

1974 

225.171 

237.686 

-12.515 

-0.053 

1975 

230.918 

232.270 

-1.352 

-0.006 

1976 

240.414 

239.737 

+0.677 

+0.003 

Mean  Absolute  Error 

6.023 

— 

Mean  Absolute  Percent  Error 

— 

0.026 

Manufacturing 

Employment 

Not  Estimated 

Total 

1973 

356.351 

365.900 

-9.549 

-0.026 

Employment 

1974 

354.185 

366.700 

-12.515 

-0.034 

1975 

347.848 

349.200 

-1.352 

-0.004 

1976 

366.977 

366.300 

+0.677 

+0.002 

Mean  Absolute  Error 

6.023 

- 

Mean  Absolute  Percent  Error 

— 

0.016 

t 
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The  specification  error  tests  are  conducted  by  distinguishing  among  various  regression 
models.  Many  regressions  with  different  specifications  are  run.  Those  retained  here  seem  to 
have  the  correct  sign  and  magnitude. 

The  Durbin- Watson  statistic  is  calculated  to  be  2.049,  which  indicates  no  serial  correlation 
in  this  service  employment  equation. 

Lastly,  in  Table  5  we  evaluate  the  forecasting  performance  of  our  estimation  by  calculat¬ 
ing  the  difference  between  the  actually  observed  value  and  the  forecast  value  using  the  ex  post 
data  for  the  explanatory  variables.  This  yields  a  mean  absolute  error  of  6.023  and  a  mean  abso¬ 
lute  percent  error  of  2.6%  for  the  service  employment  equation,  whereas  for  the  total  employ¬ 
ment  equation  the  mean  absolute  error  is  6.023,  and  the  mean  absolute  percent  error  is  1.6%. 

3.  IMPACT  AND  LAGGED  MULTIPLIERS,  SIMULTANEOUS  EQUATIONS  SYSTEM, 
AND  TWO-STAGE  LEAST  SQUARES  METHOD 

3.1  Specification  of  the  Model 

More  recently,  a  simultaneous  equations  system  has  been  tested  with  a  partial  adjustment 
model  built  in  as  its  theoretical  basis.  Assuming  the  desired  level  of  the  manufacturing  employ¬ 
ment  ( M *)  in  the  state  in  a  year  is  a  linear  function  of  time  trend  (/),  cyclical  influence  00, 
and  cyclical  influence  lagged  by  one  year  ( K_,),  we  have 

(3.1)  M*  —  a  4-  b  i  +  c  V  +  d  V_x  +  w,. 

Due  to  institutional  rigidity,  technological  constraints,  and  other  reasons,  it  is  theorized  that  the 
actual  level  and  the  desired  level  can  only  be  adjusted  partially  as  follows: 

(3.2)  M  —  A/_!  =  k(M *  -  A/_,)  +  v, 
where  0  <  k  <  1 ,  and  v  is  a  random  disturbance. 

Solving  for  M*  in  Eq.  (3.2)  and  inserting  the  result  into  Eq.  (3.1),  we  obtain 

(3.3)  M  =  ak  +  bk  l  +  ck  V  +  dk  K_,  +  (1— A)  M_,  +  ( ku ,  +  v). 

Similarly,  we  work  out  the  service  employment  equation  as  the  following: 

(3.4)  S*  =  h  +  s{M  +  C)  +  m(M  +  C)_i  +  v>  D  +  r  t  4-  1/4, 

(3.5)  S-S_,  =  £(S*-S_,)  +  e, 

(3.6)  S  =  hg  +  sg(M  +  C)  +  mg(M  +  C)_i  +  wg  D  +  rg  t  +  (1  -  g)  5_]  +  (gu4  +  e). 

Equations  (3.3)  and  (3.6)  together  with  the  identity  (2.1)  constitute  our  second  model, 
which  is  a  simultaneous  equations  system  and  which  will  be  estimated  by  the  two-stage  least 
squares  method. 

3.2  Estimation,  Testing,  and  Evaluation 

Since  all  the  explanatory  variables  on  the  right-hand  side  of  Eq.  (3.3)  are  either  exo¬ 
genous  or  predetermined,  we,  using  the  same  time  series  data  but  one  year  less,  estimate  Eq. 
(3.3)  first  by  the  least  squares  method.  The  result  is  presented  in  Table  6(a). 
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TABLE  6  —  Estimation  of  Impact  and  Lagged  Multipliers  and  Simultaneous 

Equations  System 


(a)  Manufacturing  Employment  Equation* 

Variable 

Estimated  Coefficient 

t  Statistic 

K-intercept 

60.225 

— 

Time 

0.952 

1.516 

One,  if  a  downswing 

-8.351 

-3.065 

One,  if  a  downswing  lagged  1  yr. 

-5.487 

-1.033 

Manufacturing  employment  lagged  1  yr. 

0.471 

1.483 

•R2  -  0.707,  F(4,6)  -  7.023,  S.E.E.  -  2.831. 


(b)  Service  Employment  Equation* 

Variable 

Estimated  Coefficient 

t  Statistic 

T-intercept 

r  125.533 

— 

Calculated  manufacturing  employment 

and  Navy  civilian  workers 

0.377 

1.580 

Manufacturing  employment  and  Navy 

civilian  workers  lagged  1  yr. 

-0.237 

-0.977 

Navy  military  personnel 

0.098 

0.317 

Time 

5.105 

1.460 

Service  employment  lagged  1  yr. 

0.078 

0.112 

*R2  -  0.987,  F( 5,5)  -  151.535,  S.E.E.  -  2.089. 


In  Eq.  (3.6),  Mis  the  only  endogenous  variable  on  the  right-hand  side  of  the  equation,  so 
we  use  the  estimated  values  of  M  and  estimate  Eq.  (3.6)  by  the  least  squares  method  again. 
This  produces  the  statistics  in  Table  6(b). 

Finally,  using  the  estimated  regression  coefficients  obtained  from  Eqs.  (2.1),  (3.3),  and 

(3.6) ,  we  form  the  following  matrix  equation: 

(3.7)  WY,  »  AYr.,  +  BX,  +  O',.,  +  u„ 

where  W,  A,  B,  and  Care  estimated  regression  coefficient  matrices,^,  and  are  the  column 
vectors  of  the  endogenous  variables,  and  X,  and  X,~\  the  column  vectors  of  the  exogenous  vari¬ 
ables,  at  periods  t  and  / — 1,  respectively. 

Premultiplying  Eq.  (3.7)  by  1V~1,  we  have 
Y,  -  W~'AY,. ,  +  W~xBX,  +  W~'CX,^  +  W~xu„ 

(3  8>  “  +  +  +  V 

where  A*  -  W~XA,  B*~  W~XB,  and  C*  =  If  'C. 

Now,  lagging  the  matrix  Eq.  (3.8)  by  one  period  and  substituting  the  result  into  the  same 
matrix  equation  repeatedly,  we  obtain 

Y,  -  C4*)*_T,-*  +  B*X,(A*B*  +  C*)Al-i  +  A'lA'/r  +  C*U,-2 

(3.9)  +  (A*)7{A*B*  +  C*U,_  3  + . 

+  (A*)k-HA*B'  +  C*)X,-k+i  +  (A')k-'C'X^k. 

where  we  have,  of  course,  ignored  the  residual  term. 
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In  order  to  have  the  system  to  be  stable,  it  is  required  that  lim  (A*)k  =  0  when  k  — * 
and  all  the  characteristic  roots  of  A  *  be  <  1  in  its  absolute  value.  The  coefficients  of  B*  will 
then  serve  as  our  impact  multiplier  after  one-shot  increase  of  its  related  exogenous  variable,  the 
coefficients  of  ( A*B *  +  C*)  our  lagged  multiplier  lagged  by  one  period.  A* (A*B*  +  C*)  our 
lagged  multiplier  lagged  by  two  periods,  etc. 

In  our  case,  the  system  is  found  to  be  stable,  and  the  impact  multipliers  for  Navy  civilian 
worker  and  military  personnel  are  estimated  to  be  0.377  and  0.098,  respectively.  When  com¬ 
pared  to  the  impact  multipliers  of  0.499  and  0.216  from  the  reduced  form  equation  in  the  first 
model,  it  is  found  that  they  are  slightly  smaller,  and  seem  more  reasonable  as  the  aftereffects 
are  known  now. 

Those  lagged  multipliers  for  Navy  civilian  workers  lagged  by  one,  two,  and  three  years  are 
estimated  to  be  0.029,  0.0023,  and  0.00018,  while  those  for  Navy  military  personnel  are 
estimated  at  0.008,  0.0006,  and  0.000047,  respectively. 

The  secondary  job  loss  for  1973  and  1974  is  calculated  in  Table  7,  which  should  be  com¬ 
pared  to  Table  3.  The  impact  effect  in  1973  turns  out  to  be  smaller,  and  the  impact  and  lagged 
effects  are  also  smaller  in  1974.  The  advantage  of  this  second  model  is  then  the  capability  of 
calculating  both  the  impact  effect  as  well  as  the  lagged  effect  lagged  as  many  years  as  required. 


TABLE  7  —  Estimation  of  Secondary  Job  Loss  by  Impact  Multiplier  for  1973 
and  by  Impact  and  Lagged  Multipliers  for  1974 


—  -  -  — - ■ — — — - 

Actual  Reduction 

— 

Secondary 

Year 

Category 

Civilian 

Military 

Multiplier 

Job 

Loss 

Workers 

Personnel 

1973 

Civilian 

Workers 

03,504 

0.377 

-1,321 

Military 

Personnel 

18,523 

0.098 

-1,815 

Total 

3,504 

18,523 

— 

-3,136 

Civilian 

02,395(74) 

0.377 

-903 

1974 

Workers 

03,504(73) 

0.029 

-102 

Military 

04,609(74) 

0.098 

-452 

Personnel 

18,523(73) 

0.008 

-148 

Total 

05,899 

23,132 

-1,605 

To  demonstrate  the  lagged  effects  for  many  years,  we  calculate  the  cumulated  secondary 
job  loss  for  the  year  of  1976  in  Table  8. 

Following  the  same  test  for  muhicollinearity  as  we  did  before,  we  regress  each  of  the 
explanatory  variables  on  all  the  remaining  explanatory  variables  for  both  the  manufacturing 
employment  equation  and  the  service  employment  equation.  The  results  are  collected  in  Table 
9,  (a)  and  (b).  It  seems  that  the_degree  of  multicollinearity  is  high  in  the  manufacturing 
employment  equation  (the  highest  R2  -  0.867),  and  that  in  the  service  employment  equation 
it  is  even  higher  (the  highest  R 2  =»  0.998).  This  is  so  because  of  the  fact  that  time  trend  (t) 
plays  a  more  significant  role  in  the  latter  equation  than  in  the  former  equation. 

To  test  the  specification  error,  we  run  many  regressions  with  different  specifications  again. 
The  estimated  Eqs.  (3.3)  and  (3.6)  again  seem  to  have  the  correct  sign  and  magnitude. 
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Next,  we  run  the  special  test  for  serial  correlation  as  given  by  Durbin  |1],  as  in  our  least 
squares  regression  some  of  the  regressors  are  lagged  dependent  variables.  However,  the  h 

TABLE  8  —  Estimation  of  Secondary  Job  Loss  by  Impact  and  Lagged  Multipliers  for  7976 


Category 

— 

Actual  Reduction 

Multiplier 

Secondary 

Job 

Loss 

Civilian  Military 

Workers  Personnel 

67(76) 

0.3771 

-25 

Civilian 

+  216(75) 

0.2943E-1 

+  6 

Workers 

2,395(74) 

0.2296E-2 

-5 

3,504(73) 

0.1791E-3 

-1 

211(76) 

0.9822E-1 

-21 

Military 

34(75) 

0.7663E-2 

-0 

Personnel 

4,609(74) 

0.5979E-3 

-3 

18,523(73) 

0.4665E-4 

-1 

Total 

5,750  23,377 

— 

-50 

TABLE  9  —  Multicollinearity  Tests  for  the  (a)  Manufacturing 
and  (b)  Service  Employment  Equations 


Functional  Form  For  (a) 

R7 

t  =  g(K  K_„  AL,) 

0.858 

V  =  h(t.  V~\,  M_i) 

0.240 

K_i  =  iit,  K  M_,) 

0.867 

M-\  -  jit.  V,  V~\) 

0.781 

Functional  Form  For  (b) 

R2 

Mc  +  C  -  g((M  +  CL,.  D,  t,  S -,) 

0.720 

(M  +  C)-i  =  h  ((Mr  +  C),  D,  t,  5-,) 

0.801 

t  =  i((Mf  +  C),  (M  +  C)-,,  D,  5_,) 

0.997 

D  =  ji(Mc  +  C),  (M  +  C)-|,  t.  5-,) 

0.452 

S_)  -  k  ((Mc  +  C),  (M  +  CL).  D.  t ) 

0.998 

statistic  cannot  be  calculated  because  nV(b |)  >  1,  where  b\  is  the  regression  coefficient  of  the 
lagged  dependent  variable.  Following  the  procedure  given  at  the  same  source,  we  run  the 
regression  of  e,  (the  residual  at  period  t )  on  _  j .  t,  V,  and  A/_  t  for  the  manufacturing 
employment  equation,  and  find  that  the  t  statistic  for  e_)  is  not  significant  at  the  1%  level. 
However,  when  we  run  the  regression  of  e,  on  e,_x,  ( Mc  +  C),  (M  +  CL),  D,  t ,  and  5_(  for 
the  service  employment  equation,  the  r  statistic  for  e,_i  is  significant  at  the  5%  level  but  not  at 
the  1%  level.  Thus,  it  is  concluded  that  serial  correlation  is  not  present  in  the  manufacturing 
employment  equation,  but  is  suspected  in  the  service  employment  equation. 

Lastly,  the  forecasting  performance  of  our  second  model  is  evaluated  in  Table  10.  The 
mean  absolute  errors  for  the  service,  manufacturing,  and  total  employment  equations  are  found 
to  be  5.059,  3.333,  and  6.133,  and  the  mean  absolute  percent  errors  are  0.021,  0.028,  and 
0.017,  respectively.  They  are  extremely  small  again. 
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TABLE  10  —  Evaluation  of  the  Forecasting  Performance 
of  the  Simultaneous  Equations  System 


Equation 

Year 

Forecast 

Value 

Actual 

Value 

Error 

Percent 

Error 

1973 

23G.672 

235.191 

-4.519 

-0.019 

1974 

232.798 

237.686 

-4.888 

-0.021 

Service 

1975 

236.991 

232.270 

+4.721 

+0.020 

Employment 

1976 

245.846 

239.737 

+6.109 

+0.025 

Mean  Absolute  Error 

5.059 

— 

Mean  Absolute  Percent  Error 

— 

0.021 

1973 

128.886 

124.300 

+4.586 

+0.037 

1974 

123.747 

125.000 

-1.253 

-0.010 

Manufacturing 

1975 

119.542 

112.700 

+6.842 

+  0.061 

Employment 

1976 

123.052 

122.400 

+0.652 

+  0.005 

Mean  Absolute  Error 

3.333 

— 

Mean  Absolute  Percent  Error 

— 

0.028 

1973 

365.967 

365.900 

+0.067 

+  0.000 

1974 

360.559 

366.700 

-6.141 

-0.017 

Total 

1975 

360.763 

349.200 

+  11.563 

+  0.033 

Employment 

1976 

373.061 

366.300 

+  6.761 

+0.018 

Mean  Absolute  Error 

6.133 

— 

Mean  Absolute  Percent  Error 

— 

0.017 

4.  CONCLUSIONS 

Based  upon  the  preceding  study,  it  is  concluded  that 

•  An  ad  hoc  economic  impact  study  should  be  conducted  ror  the  state  or  region  involved 
whenever  there  is  a  defense  basr  sing.  The  data,  variables  and  models  should  be  directly 
related  to  the  peculiar  econorr  .id  demographic  conditions  of  the  state  or  region.  The 
current  practice  by  the  Defense  L  partment  of  using  an  employment  multiplier  of  2.585  for  the 
civilian  workers  and  an  employment  multiplier  of  0.662  for  the  military  personnel,  which  were 
estimated  from  a  cross-section  of  data  of  15  defense  base  closings,  should  be  avoided  (see 
Lynch  [4]).  In  our  1973  case,  there  is  no  doubt  now  about  the  very  unrealistic  forecasts  made 
from  these  employment  multipliers. 

•  If  resources  are  in  short  supply,  a  simple  reduced  form  equation  which  gives  the 
impact  multiplier  only  may  be  worked  out;  otherwise,  a  simultaneous  equations  system  should 
be  estimated.  The  latter  presents  not  only  the  impact  multiplier  but  also  the  lagged  multipliers 
lagged  as  many  years  as  desired. 

•  It  is  often  criticized  that  the  above  study  contains  a  large  measure  of  multicollinearity 
which  renders  the  research  useless.  As  it  is  argued  by  Kmenta  (3],  multicollinearity  is  "a  ques¬ 
tion  of  degree  and  not  of  kind."  In  most  of  the  real  world  data,  it  seems  that  there  is  always 
some  degree  of  multicollinearity  present  in  the  sample.  As  long  as  our  experiments  cannot  be 
controlled,  multicollinearity  seems  to  be  unavoidable  in  any  social  or  economic  environment. 
Further,  since  our  purpose  is  focused  on  forecasting  rather  than  explanation,  it  seems  that  as 
long  as  the  forecast  is  accurate,  multicollinearity  is  of  secondary  importance. 
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•  Further  research  may  be  carried  out  by  disaggregating  the  manufacturing  and  service 
employment  into  various  two-digit  SIC  groups  or  some  particular  industries  in  the  state,  which 
are  closely  related  to  the  defense  base  closings,  if  desired. 

•  If  time,  financial  resources,  and  data  at  the  state  level  are  available,  a  more  sophisti¬ 
cated  model  of  various  employments  may,  based  upon  the  neoclassical  theory  of  the  firm,  be 
specified  by  introducing,  say,  the  constant-elasticity-of-substitution  (CES)  production  function 
with  efficiency,  distribution,  substitution,  and  return-of-scale  parameters  included,  if  appropri¬ 
ate.  Then,  another  question  arises:  in  view  of  the  high  forecasting  performance  of  this  simple 
model,  is  the  benefit  gained  by  economic  sophistication  worth  the  cost;  or,  in  other  words,  will 
the  economically  sophisticated  model  perform  better? 
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